Skip to main content

Showing 1–50 of 51 results for author: Gerstner, W

.
  1. arXiv:2506.14951  [pdf, ps, other

    cs.LG cs.AI cs.NE

    Flat Channels to Infinity in Neural Loss Landscapes

    Authors: Flavio Martinelli, Alexander Van Meegen, Berfin Şimşek, Wulfram Gerstner, Johanni Brea

    Abstract: The loss landscapes of neural networks contain minima and saddle points that may be connected in flat regions or appear in isolation. We identify and characterize a special structure in the loss landscape: channels along which the loss decreases extremely slowly, while the output weights of at least two neurons, $a_i$ and $a_j$, diverge to $\pm$infinity, and their input weight vectors,… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  2. arXiv:2311.01644  [pdf, other

    cs.LG cs.NE stat.ML

    Should Under-parameterized Student Networks Copy or Average Teacher Weights?

    Authors: Berfin Şimşek, Amire Bendjeddou, Wulfram Gerstner, Johanni Brea

    Abstract: Any continuous function $f^*$ can be approximated arbitrarily well by a neural network with sufficiently many neurons $k$. We consider the case when $f^*$ itself is a neural network with one hidden layer and $k$ neurons. Approximating $f^*$ with a neural network with $n< k$ neurons can thus be seen as fitting an under-parameterized "student" network with $n$ neurons to a "teacher" network with… ▽ More

    Submitted 15 January, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 41 pages, presented at NeurIPS 2023

  3. arXiv:2306.08744  [pdf, other

    cs.NE cs.LG

    High-performance deep spiking neural networks with 0.3 spikes per neuron

    Authors: Ana Stanojevic, Stanisław Woźniak, Guillaume Bellec, Giovanni Cherubini, Angeliki Pantazi, Wulfram Gerstner

    Abstract: Communication by rare, binary spikes is a key factor for the energy efficiency of biological brains. However, it is harder to train biologically-inspired spiking neural networks (SNNs) than artificial neural networks (ANNs). This is puzzling given that theoretical results provide exact mapping algorithms from ANNs to SNNs with time-to-first-spike (TTFS) coding. In this paper we analyze in theory a… ▽ More

    Submitted 20 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

  4. arXiv:2306.03603  [pdf, other

    q-bio.NC

    Trial matching: capturing variability with data-constrained spiking neural networks

    Authors: Christos Sourmpis, Carl Petersen, Wulfram Gerstner, Guillaume Bellec

    Abstract: Simultaneous behavioral and electrophysiological recordings call for new methods to reveal the interactions between neural activity and behavior. A milestone would be an interpretable model of the co-variability of spiking activity and behavior across trials. Here, we model a mouse cortical sensory-motor pathway in a tactile detection task reported by licking with a large recurrent spiking neural… ▽ More

    Submitted 1 December, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 12 pages of main text, 4 figures in main, 5 pages of appendix, 5 figures in appendix

  5. arXiv:2306.01690  [pdf, other

    cs.LG cs.AI

    Context selectivity with dynamic availability enables lifelong continual learning

    Authors: Martin Barry, Wulfram Gerstner, Guillaume Bellec

    Abstract: "You never forget how to ride a bike", -- but how is that possible? The brain is able to learn complex skills, stop the practice for years, learn other skills in between, and still retrieve the original knowledge when necessary. The mechanisms of this capability, referred to as lifelong learning (or continual learning, CL), are unknown. We suggest a bio-plausible meta-plasticity rule building on c… ▽ More

    Submitted 25 January, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

  6. arXiv:2304.12794  [pdf, other

    cs.NE

    Expand-and-Cluster: Parameter Recovery of Neural Networks

    Authors: Flavio Martinelli, Berfin Simsek, Wulfram Gerstner, Johanni Brea

    Abstract: Can we identify the weights of a neural network by probing its input-output mapping? At first glance, this problem seems to have many solutions because of permutation, overparameterisation and activation function symmetries. Yet, we show that the incoming weight vector of each neuron is identifiable up to sign or scaling, depending on the activation function. Our novel method 'Expand-and-Cluster'… ▽ More

    Submitted 27 June, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted paper at ICML '24

  7. arXiv:2303.05174  [pdf, other

    q-bio.NC

    Emergent rate-based dynamics in duplicate-free populations of spiking neurons

    Authors: Valentin Schmutz, Johanni Brea, Wulfram Gerstner

    Abstract: Can Spiking Neural Networks (SNNs) approximate the dynamics of Recurrent Neural Networks (RNNs)? Arguments in classical mean-field theory based on laws of large numbers provide a positive answer when each neuron in the network has many "duplicates", i.e. other neurons with almost perfectly correlated inputs. Using a disordered network model that guarantees the absence of duplicates, we show that d… ▽ More

    Submitted 7 November, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: 29 pages, 6 figures

  8. arXiv:2301.10638  [pdf, ps, other

    cs.LG

    MLPGradientFlow: going with the flow of multilayer perceptrons (and finding minima fast and accurately)

    Authors: Johanni Brea, Flavio Martinelli, Berfin Şimşek, Wulfram Gerstner

    Abstract: MLPGradientFlow is a software package to solve numerically the gradient flow differential equation $\dot θ= -\nabla \mathcal L(θ; \mathcal D)$, where $θ$ are the parameters of a multi-layer perceptron, $\mathcal D$ is some data set, and $\nabla \mathcal L$ is the gradient of a loss function. We show numerically that adaptive first- or higher-order integration methods based on Runge-Kutta schemes h… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  9. arXiv:2212.12522  [pdf, other

    cs.NE cs.LG

    An Exact Mapping From ReLU Networks to Spiking Neural Networks

    Authors: Ana Stanojevic, Stanisław Woźniak, Guillaume Bellec, Giovanni Cherubini, Angeliki Pantazi, Wulfram Gerstner

    Abstract: Deep spiking neural networks (SNNs) offer the promise of low-power artificial intelligence. However, training deep SNNs from scratch or converting deep artificial neural networks to SNNs without loss of performance has been a challenge. Here we propose an exact mapping from a network with Rectified Linear Units (ReLUs) to an SNN that fires exactly one spike per neuron. For our constructive proof,… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  10. A taxonomy of surprise definitions

    Authors: Alireza Modirshanechi, Johanni Brea, Wulfram Gerstner

    Abstract: Surprising events trigger measurable brain activity and influence human behavior by affecting learning, memory, and decision-making. Currently there is, however, no consensus on the definition of surprise. Here we identify 18 mathematical definitions of surprise in a unifying framework. We first propose a technical classification of these definitions into three groups based on their dependence on… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: To appear in the Journal of Mathematical Psychology

    Journal ref: Journal of Mathematical Psychology Volume 110, September 2022, 102712

  11. arXiv:2208.09416  [pdf, other

    cs.NE

    Kernel Memory Networks: A Unifying Framework for Memory Modeling

    Authors: Georgios Iatropoulos, Johanni Brea, Wulfram Gerstner

    Abstract: We consider the problem of training a neural network to store a set of patterns with maximal noise robustness. A solution, in terms of optimal weights and state update rules, is derived by training each individual neuron to perform either kernel classification or interpolation with a minimum weight norm. By applying this method to feed-forward and recurrent networks, we derive optimal models, term… ▽ More

    Submitted 23 July, 2024; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: 24 pages, 5 figures. This is the version published in the NeurIPS 2022 proceedings

  12. arXiv:2205.13493  [pdf, other

    q-bio.NC cs.LG stat.ML

    Mesoscopic modeling of hidden spiking neurons

    Authors: Shuqi Wang, Valentin Schmutz, Guillaume Bellec, Wulfram Gerstner

    Abstract: Can we use spiking neural networks (SNN) as generative models of multi-neuronal recordings, while taking into account that most neurons are unobserved? Modeling the unobserved neurons with large pools of hidden spiking neurons leads to severely underconstrained problems that are hard to tackle with maximum likelihood estimation. In this work, we use coarse-graining and mean-field approximations to… ▽ More

    Submitted 7 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 23 pages, 7 figures

  13. arXiv:2106.10064  [pdf, other

    stat.ML cs.LG q-bio.NC

    Fitting summary statistics of neural data with a differentiable spiking network simulator

    Authors: Guillaume Bellec, Shuqi Wang, Alireza Modirshanechi, Johanni Brea, Wulfram Gerstner

    Abstract: Fitting network models to neural activity is an important tool in neuroscience. A popular approach is to model a brain area with a probabilistic recurrent spiking network whose parameters maximize the likelihood of the recorded activity. Although this is widely used, we show that the resulting model does not produce realistic neural activity. To correct for this, we suggest to augment the log-like… ▽ More

    Submitted 14 November, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

  14. arXiv:2105.12221  [pdf, other

    cs.LG

    Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and Invariances

    Authors: Berfin Şimşek, François Ged, Arthur Jacot, Francesco Spadaro, Clément Hongler, Wulfram Gerstner, Johanni Brea

    Abstract: We study how permutation symmetries in overparameterized multi-layer neural networks generate `symmetry-induced' critical points. Assuming a network with $ L $ layers of minimal widths $ r_1^*, \ldots, r_{L-1}^* $ reaches a zero-loss minimum at $ r_1^*! \cdots r_{L-1}^*! $ isolated points that are permutations of one another, we show that adding one extra neuron to each layer is sufficient to conn… ▽ More

    Submitted 12 September, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: 29 pages, 12 figures, ICML 2021

  15. arXiv:2105.10109  [pdf, other

    q-bio.NC

    Correlation-invariant synaptic plasticity

    Authors: Carlos Stein N. Brito, Wulfram Gerstner

    Abstract: Cortical populations of neurons develop sparse representations adapted to the statistics of the environment. While existing synaptic plasticity models reproduce some of the observed receptive-field properties, a major obstacle is the sensitivity of Hebbian learning to omnipresent spurious correlations in cortical networks which can overshadow relevant latent input features. Here we develop a theor… ▽ More

    Submitted 15 September, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

  16. arXiv:2010.08262  [pdf, other

    cs.NE cs.AI cs.AR cs.CV cs.LG

    Local plasticity rules can learn deep representations using self-supervised contrastive predictions

    Authors: Bernd Illing, Jean Ventura, Guillaume Bellec, Wulfram Gerstner

    Abstract: Learning in the brain is poorly understood and learning rules that respect biological constraints, yet yield deep hierarchical representations, are still unknown. Here, we propose a learning rule that takes inspiration from neuroscience and recent advances in self-supervised deep learning. Learning minimizes a simple layer-specific loss function and does not need to back-propagate error signals wi… ▽ More

    Submitted 25 October, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

  17. Paradoxical Results of Long-Term Potentiation explained by Voltage-based Plasticity Rule

    Authors: Claire Meissner-Bernard, Matthias Tsai, Laureline Logiaco, Wulfram Gerstner

    Abstract: Experiments have shown that the same stimulation pattern that causes Long-Term Potentiation in proximal synapses, will induce Long-Term Depression in distal ones. In order to understand these, and other, surprising observations we use a phenomenological model of Hebbian plasticity at the location of the synapse. Our computational model describes the Hebbian condition of joint activity of pre- and… ▽ More

    Submitted 18 November, 2020; v1 submitted 10 January, 2020; originally announced January 2020.

    Journal ref: Front. Synaptic Neurosci. (2020) 12:585539

  18. arXiv:1910.10559  [pdf, other

    q-bio.NC cs.NE

    Working memory facilitates reward-modulated Hebbian learning in recurrent neural networks

    Authors: Roman Pogodin, Dane Corneil, Alexander Seeholzer, Joseph Heng, Wulfram Gerstner

    Abstract: Reservoir computing is a powerful tool to explain how the brain learns temporal sequences, such as movements, but existing learning schemes are either biologically implausible or too inefficient to explain animal performance. We show that a network can learn complicated sequences with a reward-modulated Hebbian learning rule if the network of reservoir neurons is combined with a second network tha… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 workshop "Real Neurons & Hidden Units: Future directions at the intersection of neuroscience and artificial intelligence", Vancouver, Canada

  19. arXiv:1907.02936  [pdf, other

    stat.ML cs.LG q-bio.NC stat.AP

    Learning in Volatile Environments with the Bayes Factor Surprise

    Authors: Vasiliki Liakoni, Alireza Modirshanechi, Wulfram Gerstner, Johanni Brea

    Abstract: Surprise-based learning allows agents to rapidly adapt to non-stationary stochastic environments characterized by sudden changes. We show that exact Bayesian inference in a hierarchical model gives rise to a surprise-modulated trade-off between forgetting old observations and integrating them with the new ones. The modulation depends on a probability ratio, which we call "Bayes Factor Surprise", t… ▽ More

    Submitted 23 September, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

  20. arXiv:1907.02911  [pdf, other

    cs.LG stat.ML

    Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape

    Authors: Johanni Brea, Berfin Simsek, Bernd Illing, Wulfram Gerstner

    Abstract: The permutation symmetry of neurons in each layer of a deep neural network gives rise not only to multiple equivalent global minima of the loss function, but also to first-order saddle points located on the path between the global minima. In a network of $d-1$ hidden layers with $n_k$ neurons in layers $k = 1, \ldots, d$, we construct smooth paths between equivalent global minima that lead through… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

  21. Biologically plausible deep learning -- but how far can we go with shallow networks?

    Authors: Bernd Illing, Wulfram Gerstner, Johanni Brea

    Abstract: Training deep neural networks with the error backpropagation algorithm is considered implausible from a biological perspective. Numerous recent publications suggest elaborate models for biologically plausible variants of deep learning, typically defining success as reaching around 98% test accuracy on the MNIST data set. Here, we investigate how far we can go on digit (MNIST) and object (CIFAR10)… ▽ More

    Submitted 17 June, 2019; v1 submitted 27 February, 2019; originally announced May 2019.

    Comments: 14 pages, 4 figures

    Journal ref: Neural Networks, Volume 118, October 2019, Pages 90-101

  22. arXiv:1812.09414  [pdf, other

    q-bio.NC

    Mesoscopic population equations for spiking neural networks with synaptic short-term plasticity

    Authors: Valentin Schmutz, Wulfram Gerstner, Tilo Schwalger

    Abstract: Coarse-graining microscopic models of biological neural networks to obtain mesoscopic models of neural activities is an essential step towards multi-scale models of the brain. Here, we extend a recent theory for mesoscopic population dynamics with static synapses to the case of dynamic synapses exhibiting short-term plasticity (STP). Under the assumption that spike arrivals at synapses have Poisso… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    MSC Class: 60G55; 92B20

  23. arXiv:1812.06925  [pdf, other

    cond-mat.dis-nn q-bio.NC

    How single neuron properties shape chaotic dynamics and signal transmission in random neural networks

    Authors: Samuel P. Muscinelli, Wulfram Gerstner, Tilo Schwalger

    Abstract: While most models of randomly connected networks assume nodes with simple dynamics, nodes in realistic highly connected networks, such as neurons in the brain, exhibit intrinsic dynamics over multiple timescales. We analyze how the dynamical properties of nodes (such as single neurons) and recurrent connections interact to shape the effective dynamics in large randomly connected networks. A novel… ▽ More

    Submitted 28 February, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

  24. arXiv:1812.06669  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Learning to Generate Music with BachProp

    Authors: Florian Colombo, Johanni Brea, Wulfram Gerstner

    Abstract: As deep learning advances, algorithms of music composition increase in performance. However, most of the successful models are designed for specific musical structures. Here, we present BachProp, an algorithmic composer that can generate music scores in many styles given sufficient training data. To adapt BachProp to a broad range of musical styles, we propose a novel representation of music and t… ▽ More

    Submitted 12 June, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

    Journal ref: in Proceedings of the 16th Sound and Music Computing Conference. 2019. p. 380-386

  25. arXiv:1805.11851  [pdf, other

    q-bio.NC

    On the choice of metric in gradient-based theories of brain function

    Authors: Simone Carlo Surace, Jean-Pascal Pfister, Wulfram Gerstner, Johanni Brea

    Abstract: The idea that the brain functions so as to minimize certain costs pervades theoretical neuroscience. Since a cost function by itself does not predict how the brain finds its minima, additional assumptions about the optimization method need to be made to predict the dynamics of physiological quantities. In this context, steepest descent (also called gradient descent) is often suggested as an algori… ▽ More

    Submitted 21 December, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Revised version; 14 pages, 4 figures

  26. Optimal stimulation protocol in a bistable synaptic consolidation model

    Authors: Chiara Gastaldi, Samuel P. Muscinelli, Wulfram Gerstner

    Abstract: Consolidation of synaptic changes in response to neural activity is thought to be fundamental for memory maintenance over a timescale of hours. In experiments, synaptic consolidation can be induced by repeatedly stimulating presynaptic neurons. However, the effectiveness of such protocols depends crucially on the repetition frequency of the stimulations and the mechanisms that cause this complex d… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: 23 pages, 6 figures

    Journal ref: Front. Comput. Neurosci., 13 November 2019

  27. arXiv:1802.05162  [pdf, other

    cs.SD eess.AS

    BachProp: Learning to Compose Music in Multiple Styles

    Authors: Florian Colombo, Wulfram Gerstner

    Abstract: Hand in hand with deep learning advancements, algorithms of music composition increase in performance. However, most of the successful models are designed for specific musical structures. Here, we present BachProp, an algorithmic composer that can generate music scores in any style given sufficient training data. To adapt BachProp to a broad range of musical styles, we propose a novel normalized r… ▽ More

    Submitted 20 February, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: Preliminary work. Under review by the 2018 International Conference on Machine Learning (ICML)

  28. arXiv:1802.04325  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation

    Authors: Dane Corneil, Wulfram Gerstner, Johanni Brea

    Abstract: Modern reinforcement learning algorithms reach super-human performance on many board and video games, but they are sample inefficient, i.e. they typically require significantly more playing experience than humans to reach an equal performance level. To improve sample efficiency, an agent may build a model of the environment and use planning methods to update its policy. In this article we introduc… ▽ More

    Submitted 11 June, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: Accepted at ICML 2018; camera-ready version

  29. Eligibility Traces and Plasticity on Behavioral Time Scales: Experimental Support of neoHebbian Three-Factor Learning Rules

    Authors: Wulfram Gerstner, Marco Lehmann, Vasiliki Liakoni, Dane Corneil, Johanni Brea

    Abstract: Most elementary behaviors such as moving the arm to grasp an object or walking into the next room to explore a museum evolve on the time scale of seconds; in contrast, neuronal action potentials occur on the time scale of a few milliseconds. Learning rules of the brain must therefore bridge the gap between these two different time scales. Modern theories of synaptic plasticity have postulated th… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

  30. arXiv:1712.10158  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SY stat.ML

    Non-linear motor control by local learning in spiking neural networks

    Authors: Aditya Gilra, Wulfram Gerstner

    Abstract: Learning weights in a spiking neural network with hidden neurons, using local, stable and online rules, to control non-linear body dynamics is an open problem. Here, we employ a supervised scheme, Feedback-based Online Local Learning Of Weights (FOLLOW), to train a network of heterogeneous spiking neurons with hidden layers, to control a two-link arm so as to reproduce a desired state trajectory.… ▽ More

    Submitted 29 December, 2017; originally announced December 2017.

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:1773-1782, 2018

  31. arXiv:1712.10062  [pdf, other

    q-bio.NC cs.LG cs.NE stat.ML

    Multi-timescale memory dynamics in a reinforcement learning network with attention-gated memory

    Authors: Marco Martinolli, Wulfram Gerstner, Aditya Gilra

    Abstract: Learning and memory are intertwined in our brain and their relationship is at the core of several recent neural network models. In particular, the Attention-Gated MEmory Tagging model (AuGMEnT) is a reinforcement learning network with an emphasis on biological plausibility of memory dynamics and learning. We find that the AuGMEnT network does not solve some hierarchical tasks, where higher-level s… ▽ More

    Submitted 28 December, 2017; originally announced December 2017.

    Journal ref: Frontiers in Computational Neuroscience, 12 July 2018 | https://doi.org/10.3389/fncom.2018.00050

  32. arXiv:1711.08032  [pdf, other

    q-bio.NC

    Efficient low-dimensional approximation of continuous attractor networks

    Authors: Alexander Seeholzer, Moritz Deger, Wulfram Gerstner

    Abstract: Continuous "bump" attractors are an established model of cortical working memory for continuous variables and can be implemented using various neuron and network models. Here, we develop a generalizable approach for the approximation of bump states of continuous attractor networks implemented in networks of both rate-based and spiking neurons. The method relies on a low-dimensional parametrization… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: 23 pages, 6 figures, 3 tables. A previous version of this article was published as a thesis chapter of the first author

  33. One-shot learning and behavioral eligibility traces in sequential decision making

    Authors: Marco Lehmann, He Xu, Vasiliki Liakoni, Michael Herzog, Wulfram Gerstner, Kerstin Preuschoff

    Abstract: In many daily tasks we make multiple decisions before reaching a goal. In order to learn such sequences of decisions, a mechanism to link earlier actions to later reward is necessary. Reinforcement learning theory suggests two classes of algorithms solving this credit assignment problem: In classic temporal-difference learning, earlier actions receive reward information only after multiple repetit… ▽ More

    Submitted 12 November, 2019; v1 submitted 13 July, 2017; originally announced July 2017.

    Journal ref: eLife 2019; 8:e47463

  34. arXiv:1702.06463  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SY

    Predicting non-linear dynamics by stable local learning in a recurrent spiking neural network

    Authors: Aditya Gilra, Wulfram Gerstner

    Abstract: Brains need to predict how the body reacts to motor commands. It is an open question how networks of spiking neurons can learn to reproduce the non-linear body dynamics caused by motor commands, using local, online and stable learning rules. Here, we present a supervised learning scheme for the feedforward and recurrent connections in a network of heterogeneous spiking neurons. The error in the ou… ▽ More

    Submitted 26 April, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

    Journal ref: eLife 2017;6:e28295

  35. arXiv:1612.03214  [pdf, other

    cs.LG cs.NE q-bio.NC

    Towards deep learning with spiking neurons in energy based models with contrastive Hebbian plasticity

    Authors: Thomas Mesnard, Wulfram Gerstner, Johanni Brea

    Abstract: In machine learning, error back-propagation in multi-layer neural networks (deep learning) has been impressively successful in supervised and reinforcement learning tasks. As a model for learning in the brain, however, deep learning has long been regarded as implausible, since it relies in its basic form on a non-local plasticity rule. To overcome this problem, energy-based models with local contr… ▽ More

    Submitted 9 December, 2016; originally announced December 2016.

  36. Towards a theory of cortical columns: From spiking neurons to interacting neural populations of finite size

    Authors: Tilo Schwalger, Moritz Deger, Wulfram Gerstner

    Abstract: Neural population equations such as neural mass or field models are widely used to study brain activity on a large scale. However, the relation of these models to the properties of single neurons is unclear. Here we derive an equation for several interacting populations at the mesoscopic scale starting from a microscopic model of randomly connected generalized integrate-and-fire neuron models. Eac… ▽ More

    Submitted 21 April, 2017; v1 submitted 1 November, 2016; originally announced November 2016.

    Comments: Simulation code available from https://github.com/schwalger/mesopopdyn_gif

    Journal ref: PLoS Comput. Biol., 13(4):e1005507, 2017

  37. Multi-contact synapses for stable networks: a spike-timing dependent model of dendritic spine plasticity and turnover

    Authors: Moritz Deger, Alexander Seeholzer, Wulfram Gerstner

    Abstract: Excitatory synaptic connections in the adult neocortex consist of multiple synaptic contacts, almost exclusively formed on dendritic spines. Changes of dendritic spine shape and volume, a correlate of synaptic strength, can be tracked in vivo for weeks. Here, we present a combined model of spike-timing dependent dendritic spine plasticity and turnover that explains the steady state multi-contact c… ▽ More

    Submitted 19 September, 2016; originally announced September 2016.

    Comments: 28 pages, 9 figures

    Journal ref: Cerebral Cortex 28-4 (2018) 1396-1415

  38. Algorithmic Composition of Melodies with Deep Recurrent Neural Networks

    Authors: Florian Colombo, Samuel P. Muscinelli, Alexander Seeholzer, Johanni Brea, Wulfram Gerstner

    Abstract: A big challenge in algorithmic composition is to devise a model that is both easily trainable and able to reproduce the long-range temporal dependencies typical of music. Here we investigate how artificial neural networks can be trained on a large corpus of melodies and turned into automated music composers able to generate new melodies coherent with the style they have been trained on. We employ… ▽ More

    Submitted 23 June, 2016; originally announced June 2016.

    Comments: Proceeding of the 1st Conference on Computer Simulation of Musical Creativity, Huddersfield University

  39. arXiv:1606.05642  [pdf, other

    stat.ML cs.LG q-bio.NC

    Balancing New Against Old Information: The Role of Surprise in Learning

    Authors: Mohammadjavad Faraji, Kerstin Preuschoff, Wulfram Gerstner

    Abstract: Surprise describes a range of phenomena from unexpected events to behavioral responses. We propose a measure of surprise and use it for surprise-driven learning. Our surprise measure takes into account data likelihood as well as the degree of commitment to a belief via the entropy of the belief distribution. We find that surprise-minimizing learning dynamically adjusts the balance between new and… ▽ More

    Submitted 1 March, 2017; v1 submitted 17 June, 2016; originally announced June 2016.

  40. arXiv:1604.00087  [pdf, other

    q-bio.NC

    Automated point-neuron simplification of data-driven microcircuit models

    Authors: Christian Rössert, Christian Pozzorini, Giuseppe Chindemi, Andrew P. Davison, Csaba Eroe, James King, Taylor H. Newton, Max Nolte, Srikanth Ramaswamy, Michael W. Reimann, Willem Wybo, Marc-Oliver Gewaltig, Wulfram Gerstner, Henry Markram, Idan Segev, Eilif Muller

    Abstract: A method is presented for the reduction of morphologically detailed microcircuit models to a point-neuron representation without human intervention. The simplification occurs in a modular workflow, in the neighborhood of a user specified network activity state for the reference model, the "operating point". First, synapses are moved to the soma, correcting for dendritic filtering by low-pass filte… ▽ More

    Submitted 30 March, 2017; v1 submitted 31 March, 2016; originally announced April 2016.

    Comments: Changes since version 1: filter fitting approach replaced by a new method to directly extract the filters using a Green's functions approach. Methods Section 2.1, Results Section 3.1 and Figures 1, 2, 4, 5 have been updated to reflect these changes. Discussion has been updated to incorporate the new findings

  41. Nonlinear Hebbian learning as a unifying principle in receptive field formation

    Authors: Carlos S. N. Brito, Wulfram Gerstner

    Abstract: The development of sensory receptive fields has been modeled in the past by a variety of models including normative models such as sparse coding or independent component analysis and bottom-up models such as spike-timing dependent plasticity or the Bienenstock-Cooper-Munro model of synaptic plasticity. Here we show that the above variety of approaches can all be unified into a single common princi… ▽ More

    Submitted 4 January, 2016; originally announced January 2016.

  42. Fluctuations and information filtering in coupled populations of spiking neurons with adaptation

    Authors: Moritz Deger, Tilo Schwalger, Richard Naud, Wulfram Gerstner

    Abstract: Finite-sized populations of spiking elements are fundamental to brain function, but also used in many areas of physics. Here we present a theory of the dynamics of finite-sized populations of spiking units, based on a quasi-renewal description of neurons with adaptation. We derive an integral equation with colored noise that governs the stochastic dynamics of the population activity in response to… ▽ More

    Submitted 3 March, 2015; v1 submitted 17 November, 2013; originally announced November 2013.

    Journal ref: Phys. Rev. E 90, 062704 (2014)

  43. arXiv:1311.3586  [pdf, other

    q-bio.NC

    Spike timing prediction with active dendrites

    Authors: Richard Naud, Brice Bathellier, Wulfram Gerstner

    Abstract: A complete single-neuron model must correctly reproduce the firing of spikes and bursts. We present a study of a simplified model of deep pyramidal cells of the cortex with active dendrites. We hypothesized that we can model the soma and its apical tuft with only two compartments, without significant loss in the accuracy of spike-timing predictions. The model is based on experimentally measurable… ▽ More

    Submitted 2 December, 2013; v1 submitted 14 November, 2013; originally announced November 2013.

    Comments: 7 pages, 4 figures

  44. arXiv:1303.6708  [pdf, other

    q-bio.NC

    Reward-based learning under hardware constraints - Using a RISC processor embedded in a neuromorphic substrate

    Authors: Simon Friedmann, Nicolas Frémaux, Johannes Schemmel, Wulfram Gerstner, Karlheinz Meier

    Abstract: In this study, we propose and analyze in simulations a new, highly flexible method of implementing synaptic plasticity in a wafer-scale, accelerated neuromorphic hardware system. The study focuses on globally modulated STDP, as a special use-case of this method. Flexibility is achieved by embedding a general-purpose processor dedicated to plasticity into the wafer. To evaluate the suitability of t… ▽ More

    Submitted 20 August, 2013; v1 submitted 26 March, 2013; originally announced March 2013.

    Comments: 37 pages, 11 figures, to be published in Frontiers in Neuromorphic Engineering. This version contains major additions to the result and discussion parts

  45. Nonnormal amplification in random balanced neuronal networks

    Authors: Guillaume Hennequin, Tim P. Vogels, Wulfram Gerstner

    Abstract: In dynamical models of cortical networks, the recurrent connectivity can amplify the input given to the network in two distinct ways. One is induced by the presence of near-critical eigenvalues in the connectivity matrix W, producing large but slow activity fluctuations along the corresponding eigenvectors (dynamical slowing). The other relies on W being nonnormal, which allows the network activit… ▽ More

    Submitted 13 April, 2012; originally announced April 2012.

    Comments: 13 pages, 7 figures

    Journal ref: Physical Review E (2012) 86:011909

  46. arXiv:1011.4188  [pdf, ps, other

    q-bio.NC

    Rescaling, thinning or complementing? On goodness-of-fit procedures for point process models and Generalized Linear Models

    Authors: Felipe Gerhard, Wulfram Gerstner

    Abstract: Generalized Linear Models (GLMs) are an increasingly popular framework for modeling neural spike trains. They have been linked to the theory of stochastic point processes and researchers have used this relation to assess goodness-of-fit using methods from point-process theory, e.g. the time-rescaling theorem. However, high neural firing rates or coarse discretization lead to a breakdown of the ass… ▽ More

    Submitted 18 November, 2010; originally announced November 2010.

    Comments: 9 pages, to appear in NIPS 2010 (Neural Information Processing Systems), corrected missing reference

  47. arXiv:q-bio/0505008  [pdf, ps, other

    q-bio.NC

    Noise-enhanced computation in a model of a cortical column

    Authors: Julien Mayor, Wulfram Gerstner

    Abstract: Varied sensory systems use noise in order to enhance detection of weak signals. It has been conjectured in the literature that this effect, known as stochastic resonance, may take place in central cognitive processes such as the memory retrieval of arithmetical multiplication. We show in a simplified model of cortical tissue, that complex arithmetical calculations can be carried out and are enha… ▽ More

    Submitted 4 May, 2005; originally announced May 2005.

  48. arXiv:q-bio/0502037  [pdf, ps, other

    q-bio.NC

    Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing

    Authors: Jean-Pascal Pfister, Taro Toyoizumi, David Barber, Wulfram Gerstner

    Abstract: In timing-based neural codes, neurons have to emit action potentials at precise moments in time. We use a supervised learning paradigm to derive a synaptic update rule that optimizes via gradient ascent the likelihood of postsynaptic firing at one or several desired firing times. We find that the optimal strategy of up- and downregulating synaptic efficacies can be described by a two-phase learn… ▽ More

    Submitted 24 February, 2005; originally announced February 2005.

    Comments: 27 pages, 10 figures

  49. Signal buffering in random networks of spiking neurons: microscopic vs. macroscopic phenomena

    Authors: Julien Mayor, Wulfram Gerstner

    Abstract: In randomly connected networks of pulse-coupled elements a time-dependent input signal can be buffered over a short time. We studied the signal buffering properties in simulated networks as a function of the networks state, characterized by both the Lyapunov exponent of the microscopic dynamics and the macroscopic activity derived from mean-field theory. If all network elements receive the same… ▽ More

    Submitted 23 February, 2005; originally announced February 2005.

    Comments: 5 pages, 3 figures

  50. arXiv:q-bio/0502030  [pdf, ps, other

    q-bio.NC

    Transient Information Flow in a Network of Excitatory and Inhibitory Model Neurons: Role of Noise and Signal Autocorrelation

    Authors: Julien Mayor, Wulfram Gerstner

    Abstract: We investigate the performance of sparsely-connected networks of integrate-and-fire neurons for ultra-short term information processing. We exploit the fact that the population activity of networks with balanced excitation and inhibition can switch from an oscillatory firing regime to a state of asynchronous irregular firing or quiescence depending on the rate of external background spikes. We… ▽ More

    Submitted 23 February, 2005; originally announced February 2005.

    Comments: 27 pages, 7 figures, to appear in J. Physiology (Paris) Vol. 98