The thermodynamic limit in mean field neural networks
Authors:
Elena Agliari,
Adriano Barra,
Pierluigi Bianco,
Alberto Fachechi,
Diego Pallara
Abstract:
In the last five decades, mean-field neural-networks have played a crucial role in modelling associative memories and, in particular, the Hopfield model has been extensively studied using tools borrowed from the statistical mechanics of spin glasses. However, achieving mathematical control of the infinite-volume limit of the model's free-energy has remained elusive, as the standard treatments deve…
▽ More
In the last five decades, mean-field neural-networks have played a crucial role in modelling associative memories and, in particular, the Hopfield model has been extensively studied using tools borrowed from the statistical mechanics of spin glasses. However, achieving mathematical control of the infinite-volume limit of the model's free-energy has remained elusive, as the standard treatments developed for spin-glasses have proven unfeasible. Here we address this long-standing problem by proving that a measure-concentration assumption for the order parameters of the theory is sufficient for the existence of the asymptotic limit of the model's free energy. The proof leverages the equivalence between the free energy of the Hopfield model and a linear combination of the free energies of a hard and a soft spin-glass, whose thermodynamic limits are rigorously known. Our work focuses on the replica-symmetry level of description (for which we recover the explicit expression of the free-energy found in the eighties via heuristic methods), yet, our scheme is expected to work also under (at least) the first step of replica symmetry breaking.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
Hebbian Learning from First Principles
Authors:
Linda Albanese,
Adriano Barra,
Pierluigi Bianco,
Fabrizio Durante,
Diego Pallara
Abstract:
Recently, the original storage prescription for the Hopfield model of neural networks -- as well as for its dense generalizations -- has been turned into a genuine Hebbian learning rule by postulating the expression of its Hamiltonian for both the supervised and unsupervised protocols. In these notes, first, we obtain these explicit expressions by relying upon maximum entropy extremization à la Ja…
▽ More
Recently, the original storage prescription for the Hopfield model of neural networks -- as well as for its dense generalizations -- has been turned into a genuine Hebbian learning rule by postulating the expression of its Hamiltonian for both the supervised and unsupervised protocols. In these notes, first, we obtain these explicit expressions by relying upon maximum entropy extremization à la Jaynes. Beyond providing a formal derivation of these recipes for Hebbian learning, this construction also highlights how Lagrangian constraints within entropy extremization force network's outcomes on neural correlations: these try to mimic the empirical counterparts hidden in the datasets provided to the network for its training and, the denser the network, the longer the correlations that it is able to capture. Next, we prove that, in the big data limit, whatever the presence of a teacher (or its lacking), not only these Hebbian learning rules converge to the original storage prescription of the Hopfield model but also their related free energies (and, thus, the statistical mechanical picture provided by Amit, Gutfreund and Sompolinsky is fully recovered). As a sideline, we show mathematical equivalence among standard Cost functions (Hamiltonian), preferred in Statistical Mechanical jargon, and quadratic Loss Functions, preferred in Machine Learning terminology. Remarks on the exponential Hopfield model (as the limit of dense networks with diverging density) and semi-supervised protocols are also provided.
△ Less
Submitted 3 October, 2024; v1 submitted 13 January, 2024;
originally announced January 2024.