-
Amorphous Order & Non-linear Susceptibilities in Glassy Materials
Authors:
Giulio Biroli,
Jean-Philippe Bouchaud,
Francois Ladieu
Abstract:
We review 15 years of theoretical and experimental work on the non-linear response of glassy systems. We argue that an anomalous growth of the peak value of non-linear susceptibilities is a signature of growing "amorphous order" in the system, with spin-glasses as a case in point. Experimental results on supercooled liquids are fully compatible with the RFOT prediction of compact "glassites" of in…
▽ More
We review 15 years of theoretical and experimental work on the non-linear response of glassy systems. We argue that an anomalous growth of the peak value of non-linear susceptibilities is a signature of growing "amorphous order" in the system, with spin-glasses as a case in point. Experimental results on supercooled liquids are fully compatible with the RFOT prediction of compact "glassites" of increasing volume as temperature is decreased, or as the system ages. We clarify why such a behaviour is hard to explain within purely kinetic theories of glass formation, despite recent claims to the contrary.
△ Less
Submitted 11 January, 2021;
originally announced January 2021.
-
The Lévy-Rosenzweig-Porter random matrix ensemble
Authors:
Giulio Biroli,
Marco Tarzia
Abstract:
In this paper we consider an extension of the Rosenzweig-Porter (RP) model, the Lévy-RP (L-RP) model, in which the off-diagonal matrix elements are broadly distributed, providing a more realistic benchmark to develop an effective description of non-ergodic extended (NEE) states in interacting many-body disordered systems. We put forward a simple, general, and intuitive argument that allows one to…
▽ More
In this paper we consider an extension of the Rosenzweig-Porter (RP) model, the Lévy-RP (L-RP) model, in which the off-diagonal matrix elements are broadly distributed, providing a more realistic benchmark to develop an effective description of non-ergodic extended (NEE) states in interacting many-body disordered systems. We put forward a simple, general, and intuitive argument that allows one to unveil the multifractal structure of the mini-bands in the local spectrum when hybridization is due to anomalously large transition amplitudes in the tails of the distribution. The idea is that the energy spreading of the mini-bands can be determined self-consistently by requiring that the maximum of the matrix elements between a site $i$ and the other $N^{D_1}$ sites of the support set is of the same order of the Thouless energy itself $N^{D_1 - 1}$. This argument yields the fractal dimensions that characterize the statistics of the multifractal wave-functions in the NEE phase, as well as the whole phase diagram of the L-RP ensemble. Its predictions are confirmed both analytically, by a thorough investigation of the self-consistent equation for the local density of states obtained using the cavity approach, and numerically, via extensive exact diagonalizations.
△ Less
Submitted 17 January, 2021; v1 submitted 23 December, 2020;
originally announced December 2020.
-
Searching for the Gardner transition in glassy glycerol
Authors:
Samuel Albert,
Giulio Biroli,
François Ladieu,
Roland Tourbot,
Pierfrancesco Urbani
Abstract:
We search for a Gardner transition in glassy glycerol, a standard molecular glass, measuring the third harmonics cubic susceptibility $χ_3^{(3)}$ from slightly below the usual glass transition temperature down to $10K$. According to the mean field picture, if local motion within the glass were becoming highly correlated due to the emergence of a Gardner phase then $χ_3^{(3)}$, which is analogous t…
▽ More
We search for a Gardner transition in glassy glycerol, a standard molecular glass, measuring the third harmonics cubic susceptibility $χ_3^{(3)}$ from slightly below the usual glass transition temperature down to $10K$. According to the mean field picture, if local motion within the glass were becoming highly correlated due to the emergence of a Gardner phase then $χ_3^{(3)}$, which is analogous to the dynamical spin-glass susceptibility, should increase and diverge at the Gardner transition temperature $T_G$. We find instead that upon cooling $| χ_3^{(3)} |$ decreases by several orders of magnitude and becomes roughly constant in the regime $100K-10K$. We rationalize our findings by assuming that the low temperature physics is described by localized excitations weakly interacting via a spin-glass dipolar pairwise interaction in a random magnetic field. Our quantitative estimations show that the spin-glass interaction is twenty to fifty times smaller than the local random field contribution, thus rationalizing the absence of the spin-glass Gardner phase. This hints at the fact that a Gardner phase may be suppressed in standard molecular glasses, but it also suggests ways to favor its existence in other amorphous solids and by changing the preparation protocol.
△ Less
Submitted 8 December, 2020; v1 submitted 7 October, 2020;
originally announced October 2020.
-
Properties of equilibria and glassy phases of the random Lotka-Volterra model with demographic noise
Authors:
Ada Altieri,
Felix Roy,
Chiara Cammarota,
Giulio Biroli
Abstract:
In this letter we study a reference model in theoretical ecology, the disordered Lotka-Volterra model for ecological communities, in the presence of finite demographic noise. Our theoretical analysis, which takes advantage of a mapping to an equilibrium disordered system, proves that for sufficiently heterogeneous interactions and low demographic noise the system displays a multiple equilibria pha…
▽ More
In this letter we study a reference model in theoretical ecology, the disordered Lotka-Volterra model for ecological communities, in the presence of finite demographic noise. Our theoretical analysis, which takes advantage of a mapping to an equilibrium disordered system, proves that for sufficiently heterogeneous interactions and low demographic noise the system displays a multiple equilibria phase, which we fully characterize. In particular, we show that in this phase the number of stable equilibria is exponential in the number of species. Upon further decreasing the demographic noise, we unveil a "Gardner" transition to a marginally stable phase, similar to that observed in jamming of amorphous materials. We confirm and complement our analytical results by numerical simulations. Furthermore, we extend their relevance by showing that they hold for others interacting random dynamical systems, such as the Random Replicant Model. Finally, we discuss their extension to the case of asymmetric couplings.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
Out of equilibrium Phase Diagram of the Quantum Random Energy Model
Authors:
Giulio Biroli,
Davide Facoetti,
Marco Schiró,
Marco Tarzia,
Pierpaolo Vivo
Abstract:
In this paper we study the out-of-equilibrium phase diagram of the quantum version of Derrida's Random Energy Model, which is the simplest model of mean-field spin glasses. We interpret its corresponding quantum dynamics in Fock space as a one-particle problem in very high dimension to which we apply different theoretical methods tailored for high-dimensional lattices: the Forward-Scattering Appro…
▽ More
In this paper we study the out-of-equilibrium phase diagram of the quantum version of Derrida's Random Energy Model, which is the simplest model of mean-field spin glasses. We interpret its corresponding quantum dynamics in Fock space as a one-particle problem in very high dimension to which we apply different theoretical methods tailored for high-dimensional lattices: the Forward-Scattering Approximation, a mapping to the Rosenzweig-Porter model, and the cavity method. Our results indicate the existence of two transition lines and three distinct dynamical phases: a completely many-body localized phase at low energy, a fully ergodic phase at high energy, and a multifractal "bad metal" phase at intermediate energy. In the latter, eigenfunctions occupy a diverging volume, yet an exponentially vanishing fraction of the total Hilbert space. We discuss the limitations of our approximations and the relationship with previous studies.
△ Less
Submitted 11 March, 2021; v1 submitted 21 September, 2020;
originally announced September 2020.
-
An analytic theory of shallow networks dynamics for hinge loss classification
Authors:
Franco Pellegrini,
Giulio Biroli
Abstract:
Neural networks have been shown to perform incredibly well in classification tasks over structured high-dimensional datasets. However, the learning dynamics of such networks is still poorly understood. In this paper we study in detail the training dynamics of a simple type of neural network: a single hidden layer trained to perform a classification task. We show that in a suitable mean-field limit…
▽ More
Neural networks have been shown to perform incredibly well in classification tasks over structured high-dimensional datasets. However, the learning dynamics of such networks is still poorly understood. In this paper we study in detail the training dynamics of a simple type of neural network: a single hidden layer trained to perform a classification task. We show that in a suitable mean-field limit this case maps to a single-node learning problem with a time-dependent dataset determined self-consistently from the average nodes population. We specialize our theory to the prototypical case of a linearly separable dataset and a linear hinge loss, for which the dynamics can be explicitly solved. This allow us to address in a simple setting several phenomena appearing in modern networks such as slowing down of training dynamics, crossover between rich and lazy learning, and overfitting. Finally, we asses the limitations of mean-field theory by studying the case of large but finite number of nodes and of training samples.
△ Less
Submitted 19 June, 2020;
originally announced June 2020.
-
Glasses and aging: A Statistical Mechanics Perspective
Authors:
Francesco Arceri,
François P. Landes,
Ludovic Berthier,
Giulio Biroli
Abstract:
We review the field of the glass transition, glassy dynamics and aging from a statistical mechanics perspective. We give a brief introduction to the subject and explain the main phenomenology encountered in glassy systems, with a particular emphasis on spatially heterogeneous dynamics. We review the main theoretical approaches currently available to account for these glassy phenomena, including re…
▽ More
We review the field of the glass transition, glassy dynamics and aging from a statistical mechanics perspective. We give a brief introduction to the subject and explain the main phenomenology encountered in glassy systems, with a particular emphasis on spatially heterogeneous dynamics. We review the main theoretical approaches currently available to account for these glassy phenomena, including recent developments regarding mean-field theory of liquids and glasses, novel computational tools, and connections to the jamming transition. Finally, the physics of aging and off-equilibrium dynamics exhibited by glassy materials is discussed.
△ Less
Submitted 8 October, 2020; v1 submitted 17 June, 2020;
originally announced June 2020.
-
Dynamical Instantons and Activated Processes in Mean-Field Glass Models
Authors:
V. Ros,
G. Biroli,
C. Cammarota
Abstract:
We focus on the energy landscape of a simple mean-field model of glasses and analyze activated barrier-crossing by combining the Kac-Rice method for high-dimensional Gaussian landscapes with dynamical field theory. In particular, we consider Langevin dynamics at low temperature in the energy landscape of the pure spherical $p$-spin model. We select as initial condition for the dynamics one of the…
▽ More
We focus on the energy landscape of a simple mean-field model of glasses and analyze activated barrier-crossing by combining the Kac-Rice method for high-dimensional Gaussian landscapes with dynamical field theory. In particular, we consider Langevin dynamics at low temperature in the energy landscape of the pure spherical $p$-spin model. We select as initial condition for the dynamics one of the many unstable index-1 saddles in the vicinity of a reference local minimum. We show that the associated dynamical mean-field equations admit two solutions: one corresponds to falling back to the original reference minimum, and the other to reaching a new minimum past the barrier. By varying the saddle we scan and characterize the properties of such minima reachable by activated barrier-crossing. Finally, using time-reversal transformations, we construct the two-point function dynamical instanton of the corresponding activated process.
△ Less
Submitted 15 December, 2020; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval
Authors:
Stefano Sarao Mannelli,
Giulio Biroli,
Chiara Cammarota,
Florent Krzakala,
Pierfrancesco Urbani,
Lenka Zdeborová
Abstract:
Despite the widespread use of gradient-based algorithms for optimizing high-dimensional non-convex functions, understanding their ability of finding good minima instead of being trapped in spurious ones remains to a large extent an open problem. Here we focus on gradient flow dynamics for phase retrieval from random measurements. When the ratio of the number of measurements over the input dimensio…
▽ More
Despite the widespread use of gradient-based algorithms for optimizing high-dimensional non-convex functions, understanding their ability of finding good minima instead of being trapped in spurious ones remains to a large extent an open problem. Here we focus on gradient flow dynamics for phase retrieval from random measurements. When the ratio of the number of measurements over the input dimension is small the dynamics remains trapped in spurious minima with large basins of attraction. We find analytically that above a critical ratio those critical points become unstable developing a negative direction toward the signal. By numerical experiments we show that in this regime the gradient flow algorithm is not trapped; it drifts away from the spurious critical points along the unstable direction and succeeds in finding the global minimum. Using tools from statistical physics we characterize this phenomenon, which is related to a BBP-type transition in the Hessian of the spurious minima.
△ Less
Submitted 12 June, 2020;
originally announced June 2020.
-
Triple descent and the two kinds of overfitting: Where & why do they appear?
Authors:
Stéphane d'Ascoli,
Levent Sagun,
Giulio Biroli
Abstract:
A recent line of research has highlighted the existence of a "double descent" phenomenon in deep learning, whereby increasing the number of training examples $N$ causes the generalization error of neural networks to peak when $N$ is of the same order as the number of parameters $P$. In earlier works, a similar phenomenon was shown to exist in simpler models such as linear regression, where the pea…
▽ More
A recent line of research has highlighted the existence of a "double descent" phenomenon in deep learning, whereby increasing the number of training examples $N$ causes the generalization error of neural networks to peak when $N$ is of the same order as the number of parameters $P$. In earlier works, a similar phenomenon was shown to exist in simpler models such as linear regression, where the peak instead occurs when $N$ is equal to the input dimension $D$. Since both peaks coincide with the interpolation threshold, they are often conflated in the litterature. In this paper, we show that despite their apparent similarity, these two scenarios are inherently different. In fact, both peaks can co-exist when neural networks are applied to noisy regression tasks. The relative size of the peaks is then governed by the degree of nonlinearity of the activation function. Building on recent developments in the analysis of random feature models, we provide a theoretical ground for this sample-wise triple descent. As shown previously, the nonlinear peak at $N\!=\!P$ is a true divergence caused by the extreme sensitivity of the output function to both the noise corrupting the labels and the initialization of the random features (or the weights in neural networks). This peak survives in the absence of noise, but can be suppressed by regularization. In contrast, the linear peak at $N\!=\!D$ is solely due to overfitting the noise in the labels, and forms earlier during training. We show that this peak is implicitly regularized by the nonlinearity, which is why it only becomes salient at high noise and is weakly affected by explicit regularization. Throughout the paper, we compare analytical results obtained in the random feature model with the outcomes of numerical experiments involving deep neural networks.
△ Less
Submitted 13 October, 2020; v1 submitted 5 June, 2020;
originally announced June 2020.
-
Dynamical Mean-Field Theory and Aging Dynamics
Authors:
Ada Altieri,
Giulio Biroli,
Chiara Cammarota
Abstract:
Dynamical Mean-Field Theory (DMFT) replaces the many-body dynamical problem with one for a single degree of freedom in a thermal bath whose features are determined self-consistently. By focusing on models with soft disordered $p$-spin interactions, we show how to incorporate the mean-field theory of aging within dynamical mean-field theory. We study cases with only one slow time-scale, correspondi…
▽ More
Dynamical Mean-Field Theory (DMFT) replaces the many-body dynamical problem with one for a single degree of freedom in a thermal bath whose features are determined self-consistently. By focusing on models with soft disordered $p$-spin interactions, we show how to incorporate the mean-field theory of aging within dynamical mean-field theory. We study cases with only one slow time-scale, corresponding statically to the one-step replica symmetry breaking (1RSB) phase, and cases with an infinite number of slow time-scales, corresponding statically to the full replica symmetry breaking (FRSB) phase. For the former, we show that the effective temperature of the slow degrees of freedom is fixed by requiring critical dynamical behavior on short time-scales, i.e. marginality. For the latter, we find that aging on an infinite number of slow time-scales is governed by a stochastic equation where the clock for dynamical evolution is fixed by the change of effective temperature, hence obtaining a dynamical derivation of the stochastic equation at the basis of the FRSB phase. Our results extend the realm of the mean-field theory of aging to all situations where DMFT holds.
△ Less
Submitted 11 May, 2020;
originally announced May 2020.
-
Interplay between percolation and glassiness in the random Lorentz gas
Authors:
Giulio Biroli,
Patrick Charbonneau,
Eric I. Corwin,
Yi Hu,
Harukuni Ikeda,
Grzegorz Szamel,
Francesco Zamponi
Abstract:
The random Lorentz gas (RLG) is a minimal model of transport in heterogeneous media. It also models the dynamics of a tracer in a glassy system. These two perspectives, however, are fundamentally inconsistent. Arrest in the former is related to percolation, and hence continuous, while glass-like arrest is discontinuous. In order to clarify the interplay between percolation and glassiness in the RL…
▽ More
The random Lorentz gas (RLG) is a minimal model of transport in heterogeneous media. It also models the dynamics of a tracer in a glassy system. These two perspectives, however, are fundamentally inconsistent. Arrest in the former is related to percolation, and hence continuous, while glass-like arrest is discontinuous. In order to clarify the interplay between percolation and glassiness in the RLG, we consider its exact solution in the infinite-dimensional $d\rightarrow\infty$ limit, as well as numerics in $d=2\ldots 20$. We find that the mean field solutions of the RLG and glasses fall in the same universality class, and that instantonic corrections related to rare cage escapes destroy the glass transition in finite dimensions. This advance suggests that the RLG can be used as a toy model to develop a first-principle description of hopping in structural glasses.
△ Less
Submitted 26 February, 2021; v1 submitted 24 March, 2020;
originally announced March 2020.
-
Anomalous dynamics in the ergodic side of the Many-Body Localization transition and the glassy phase of Directed Polymers in Random Media
Authors:
Giulio Biroli,
Marco Tarzia
Abstract:
Using the non-interacting Anderson tight-binding model on the Bethe lattice as a toy model for the many-body quantum dynamics, we propose a novel and transparent theoretical explanation of the anomalously slow dynamics that emerges in the bad metal phase preceding the Many-Body Localization transition. By mapping the time-decorrelation of many-body wave-functions onto Directed Polymers in Random M…
▽ More
Using the non-interacting Anderson tight-binding model on the Bethe lattice as a toy model for the many-body quantum dynamics, we propose a novel and transparent theoretical explanation of the anomalously slow dynamics that emerges in the bad metal phase preceding the Many-Body Localization transition. By mapping the time-decorrelation of many-body wave-functions onto Directed Polymers in Random Media, we show the existence of a glass transition within the extended regime separating a metallic-like phase at small disorder, where delocalization occurs on an exponential number of paths, from a bad metal-like phase at intermediate disorder, where resonances are formed on rare, specific, disorder dependent site orbitals on very distant generations. The physical interpretation of subdiffusion and non-exponential relaxation emerging from this picture is complementary to the Griffiths one, although both scenarios rely on the presence of heavy-tailed distribution of the escape times. We relate the dynamical evolution in the glassy phase to the depinning transition of Directed Polymers, which results in macroscopic and abrupt jumps of the preferred delocalizing paths when a parameter like the energy is varied, and produce a singular behavior of the overlap correlation function between eigenstates at different energies. By comparing the quantum dynamics on loop-less Cayley trees and Random Regular Graphs we discuss the effect of loops, showing that in the latter slow dynamics and apparent power-laws extend on a very large time-window but are eventually cut-off on a time-scale that diverges at the MBL transition.
△ Less
Submitted 21 March, 2020;
originally announced March 2020.
-
Double Trouble in Double Descent : Bias and Variance(s) in the Lazy Regime
Authors:
Stéphane d'Ascoli,
Maria Refinetti,
Giulio Biroli,
Florent Krzakala
Abstract:
Deep neural networks can achieve remarkable generalization performances while interpolating the training data perfectly. Rather than the U-curve emblematic of the bias-variance trade-off, their test error often follows a "double descent" - a mark of the beneficial role of overparametrization. In this work, we develop a quantitative theory for this phenomenon in the so-called lazy learning regime o…
▽ More
Deep neural networks can achieve remarkable generalization performances while interpolating the training data perfectly. Rather than the U-curve emblematic of the bias-variance trade-off, their test error often follows a "double descent" - a mark of the beneficial role of overparametrization. In this work, we develop a quantitative theory for this phenomenon in the so-called lazy learning regime of neural networks, by considering the problem of learning a high-dimensional function with random features regression. We obtain a precise asymptotic expression for the bias-variance decomposition of the test error, and show that the bias displays a phase transition at the interpolation threshold, beyond which it remains constant. We disentangle the variances stemming from the sampling of the dataset, from the additive noise corrupting the labels, and from the initialization of the weights. Following up on Geiger et al. 2019, we first show that the latter two contributions are the crux of the double descent: they lead to the overfitting peak at the interpolation threshold and to the decay of the test error upon overparametrization. We then quantify how they are suppressed by ensemble averaging the outputs of K independently initialized estimators. When K is sent to infinity, the test error remains constant beyond the interpolation threshold. We further compare the effects of overparametrizing, ensembling and regularizing. Finally, we present numerical experiments on classic deep learning setups to show that our results hold qualitatively in realistic lazy learning scenarios.
△ Less
Submitted 3 April, 2020; v1 submitted 2 March, 2020;
originally announced March 2020.
-
Role of fluctuations in the yielding transition of two-dimensional glasses
Authors:
Misaki Ozawa,
Ludovic Berthier,
Giulio Biroli,
Gilles Tarjus
Abstract:
We numerically study yielding in two-dimensional glasses which are generated with a very wide range of stabilities by swap Monte-Carlo simulations and then slowly deformed at zero temperature. We provide strong numerical evidence that stable glasses yield via a nonequilibrium discontinuous transition in the thermodynamic limit. A critical point separates this brittle yielding from the ductile one…
▽ More
We numerically study yielding in two-dimensional glasses which are generated with a very wide range of stabilities by swap Monte-Carlo simulations and then slowly deformed at zero temperature. We provide strong numerical evidence that stable glasses yield via a nonequilibrium discontinuous transition in the thermodynamic limit. A critical point separates this brittle yielding from the ductile one observed in less stable glasses. We find that two-dimensional glasses yield similarly to their three-dimensional counterparts but display larger sample-to-sample disorder-induced fluctuations, stronger finite-size effects, and rougher spatial wandering of the observed shear bands. These findings strongly constrain effective theories of yielding.
△ Less
Submitted 22 May, 2020; v1 submitted 12 December, 2019;
originally announced December 2019.
-
Landscape Complexity for the Empirical Risk of Generalized Linear Models
Authors:
Antoine Maillard,
Gérard Ben Arous,
Giulio Biroli
Abstract:
We present a method to obtain the average and the typical value of the number of critical points of the empirical risk landscape for generalized linear estimation problems and variants. This represents a substantial extension of previous applications of the Kac-Rice method since it allows to analyze the critical points of high dimensional non-Gaussian random functions. Under a technical hypothesis…
▽ More
We present a method to obtain the average and the typical value of the number of critical points of the empirical risk landscape for generalized linear estimation problems and variants. This represents a substantial extension of previous applications of the Kac-Rice method since it allows to analyze the critical points of high dimensional non-Gaussian random functions. Under a technical hypothesis, we obtain a rigorous explicit variational formula for the annealed complexity, which is the logarithm of the average number of critical points at fixed value of the empirical risk. This result is simplified, and extended, using the non-rigorous Kac-Rice replicated method from theoretical physics. In this way we find an explicit variational formula for the quenched complexity, which is generally different from its annealed counterpart, and allows to obtain the number of critical points for typical instances up to exponential accuracy.
△ Less
Submitted 18 January, 2023; v1 submitted 4 December, 2019;
originally announced December 2019.
-
Can endogenous fluctuations persist in high-diversity ecosystems?
Authors:
Felix Roy,
Matthieu Barbier,
Giulio Biroli,
Guy Bunin
Abstract:
When can complex ecological interactions drive an entire ecosystem into a persistent non-equilibrium state, where species abundances keep fluctuating without going to extinction? We show that high-diversity spatially-extended systems, in which conditions vary somewhat between spatial locations, can exhibit chaotic dynamics which persist for extremely long times. We develop a theoretical framework,…
▽ More
When can complex ecological interactions drive an entire ecosystem into a persistent non-equilibrium state, where species abundances keep fluctuating without going to extinction? We show that high-diversity spatially-extended systems, in which conditions vary somewhat between spatial locations, can exhibit chaotic dynamics which persist for extremely long times. We develop a theoretical framework, based on dynamical mean-field theory, to quantify the conditions under which these fluctuating states exist, and predict their properties. We uncover parallels with the persistence of externally-perturbed ecosystems, such as the role of perturbation strength, synchrony and correlation time. But uniquely to endogenous fluctuations, these properties arise from the species dynamics themselves, creating feedback loops between perturbation and response. A key result is that the fluctuation amplitude and species diversity are tightly linked, in particular fluctuations enable dramatically more species to coexist than at equilibrium in the very same system. Our findings highlight crucial differences between well-mixed and spatially-extended systems, with implications for experiments and their ability to reproduce natural dynamics. They shed light on the maintenance of biodiversity, and the strength and synchrony of fluctuations observed in natural systems.
△ Less
Submitted 26 August, 2019; v1 submitted 9 August, 2019;
originally announced August 2019.
-
Who is Afraid of Big Bad Minima? Analysis of Gradient-Flow in a Spiked Matrix-Tensor Model
Authors:
Stefano Sarao Mannelli,
Giulio Biroli,
Chiara Cammarota,
Florent Krzakala,
Lenka Zdeborová
Abstract:
Gradient-based algorithms are effective for many machine learning tasks, but despite ample recent effort and some progress, it often remains unclear why they work in practice in optimising high-dimensional non-convex functions and why they find good minima instead of being trapped in spurious ones.
Here we present a quantitative theory explaining this behaviour in a spiked matrix-tensor model.…
▽ More
Gradient-based algorithms are effective for many machine learning tasks, but despite ample recent effort and some progress, it often remains unclear why they work in practice in optimising high-dimensional non-convex functions and why they find good minima instead of being trapped in spurious ones.
Here we present a quantitative theory explaining this behaviour in a spiked matrix-tensor model.
Our framework is based on the Kac-Rice analysis of stationary points and a closed-form analysis of gradient-flow originating from statistical physics. We show that there is a well defined region of parameters where the gradient-flow algorithm finds a good global minimum despite the presence of exponentially many spurious local minima.
We show that this is achieved by surfing on saddles that have strong negative direction towards the global minima, a phenomenon that is connected to a BBP-type threshold in the Hessian describing the critical points of the landscapes.
△ Less
Submitted 20 January, 2020; v1 submitted 18 July, 2019;
originally announced July 2019.
-
Classical Glasses, Black Holes, and Strange Quantum Liquids
Authors:
Davide Facoetti,
Giulio Biroli,
Jorge Kurchan,
David R. Reichman
Abstract:
From the dynamics of a broad class of classical mean-field glass models one may obtain a quantum model with finite zero-temperature entropy, a quantum transition at zero temperature, and a time-reparametrization (quasi-)invariance in the dynamical equations for correlations. The low eigenvalue spectrum of the resulting quantum model is directly related to the structure and exploration of metastabl…
▽ More
From the dynamics of a broad class of classical mean-field glass models one may obtain a quantum model with finite zero-temperature entropy, a quantum transition at zero temperature, and a time-reparametrization (quasi-)invariance in the dynamical equations for correlations. The low eigenvalue spectrum of the resulting quantum model is directly related to the structure and exploration of metastable states in the landscape of the original classical glass model. This mapping reveals deep connections between classical glasses and the properties of SYK-like models.
△ Less
Submitted 4 December, 2019; v1 submitted 21 June, 2019;
originally announced June 2019.
-
Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias
Authors:
Stéphane d'Ascoli,
Levent Sagun,
Joan Bruna,
Giulio Biroli
Abstract:
Despite the phenomenal success of deep neural networks in a broad range of learning tasks, there is a lack of theory to understand the way they work. In particular, Convolutional Neural Networks (CNNs) are known to perform much better than Fully-Connected Networks (FCNs) on spatially structured data: the architectural structure of CNNs benefits from prior knowledge on the features of the data, for…
▽ More
Despite the phenomenal success of deep neural networks in a broad range of learning tasks, there is a lack of theory to understand the way they work. In particular, Convolutional Neural Networks (CNNs) are known to perform much better than Fully-Connected Networks (FCNs) on spatially structured data: the architectural structure of CNNs benefits from prior knowledge on the features of the data, for instance their translation invariance. The aim of this work is to understand this fact through the lens of dynamics in the loss landscape.
We introduce a method that maps a CNN to its equivalent FCN (denoted as eFCN). Such an embedding enables the comparison of CNN and FCN training dynamics directly in the FCN space. We use this method to test a new training protocol, which consists in training a CNN, embedding it to FCN space at a certain ``relax time'', then resuming the training in FCN space. We observe that for all relax times, the deviation from the CNN subspace is small, and the final performance reached by the eFCN is higher than that reachable by a standard FCN of same architecture. More surprisingly, for some intermediate relax times, the eFCN outperforms the CNN it stemmed, by combining the prior information of the CNN and the expressivity of the FCN in a complementary way. The practical interest of our protocol is limited by the very large size of the highly sparse eFCN. However, it offers interesting insights into the persistence of architectural bias under stochastic gradient dynamics. It shows the existence of some rare basins in the FCN loss landscape associated with very good generalization. These can only be accessed thanks to the CNN prior, which helps navigate the landscape during the early stages of optimization.
△ Less
Submitted 4 February, 2020; v1 submitted 16 June, 2019;
originally announced June 2019.
-
Attractive versus truncated repulsive supercooled liquids: The dynamics is encoded in the pair correlation function
Authors:
François P. Landes,
Giulio Biroli,
Olivier Dauchot,
Andrea J. Liu,
David R. Reichman
Abstract:
We compare glassy dynamics in two liquids that differ in the form of their interaction potentials. Both systems have the same repulsive interactions but one has also an attractive part in the potential. These two systems exhibit very different dynamics despite having nearly identical pair correlation functions. We demonstrate that a properly weighted integral of the pair correlation function, whic…
▽ More
We compare glassy dynamics in two liquids that differ in the form of their interaction potentials. Both systems have the same repulsive interactions but one has also an attractive part in the potential. These two systems exhibit very different dynamics despite having nearly identical pair correlation functions. We demonstrate that a properly weighted integral of the pair correlation function, which amplifies the subtle differences between the two systems, correctly captures their dynamical differences. The weights are obtained from a standard machine learning algorithm.
△ Less
Submitted 14 January, 2020; v1 submitted 3 June, 2019;
originally announced June 2019.
-
How to iron out rough landscapes and get optimal performances: Averaged Gradient Descent and its application to tensor PCA
Authors:
Giulio Biroli,
Chiara Cammarota,
Federico Ricci-Tersenghi
Abstract:
In many high-dimensional estimation problems the main task consists in minimizing a cost function, which is often strongly non-convex when scanned in the space of parameters to be estimated. A standard solution to flatten the corresponding rough landscape consists in summing the losses associated to different data points and obtain a smoother empirical risk. Here we propose a complementary method…
▽ More
In many high-dimensional estimation problems the main task consists in minimizing a cost function, which is often strongly non-convex when scanned in the space of parameters to be estimated. A standard solution to flatten the corresponding rough landscape consists in summing the losses associated to different data points and obtain a smoother empirical risk. Here we propose a complementary method that works for a single data point. The main idea is that a large amount of the roughness is uncorrelated in different parts of the landscape. One can then substantially reduce the noise by evaluating an empirical average of the gradient obtained as a sum over many random independent positions in the space of parameters to be optimized. We present an algorithm, called Averaged Gradient Descent, based on this idea and we apply it to tensor PCA, which is a very hard estimation problem. We show that Averaged Gradient Descent over-performs physical algorithms such as gradient descent and approximate message passing and matches the best algorithmic thresholds known so far, obtained by tensor unfolding and methods based on sum-of-squares.
△ Less
Submitted 6 February, 2020; v1 submitted 29 May, 2019;
originally announced May 2019.
-
Maximum-energy records in glassy energy landscapes
Authors:
Ivailo Hartarsky,
Marco Baity-Jesi,
Riccardo Ravasio,
Alain Billoire,
Giulio Biroli
Abstract:
We study the evolution of the maximum energy $E_\max(t)$ reached between time $0$ and time $t$ in the dynamics of simple models with glassy energy landscapes, in instant quenches from infinite temperature to a target temperature $T$. Through a detailed description of the activated dynamics, we are able to describe the evolution of $E_\max(t)$ from short times, through the aging regime, until after…
▽ More
We study the evolution of the maximum energy $E_\max(t)$ reached between time $0$ and time $t$ in the dynamics of simple models with glassy energy landscapes, in instant quenches from infinite temperature to a target temperature $T$. Through a detailed description of the activated dynamics, we are able to describe the evolution of $E_\max(t)$ from short times, through the aging regime, until after equilibrium is reached, thus providing a detailed description of the long-time dynamics. Finally, we compare our findings with numerical simulations of the $p$-spin glass and show how the maximum energy record can be used to identify the threshold energy in this model.
△ Less
Submitted 13 June, 2019; v1 submitted 16 April, 2019;
originally announced April 2019.
-
Large deviations for the largest eigenvalues and eigenvectors of spiked random matrices
Authors:
Giulio Biroli,
Alice Guionnet
Abstract:
We consider matrices formed by a random $N\times N$ matrix drawn from the Gaussian Orthogonal Ensemble (or Gaussian Unitary Ensemble) plus a rank-one perturbation of strength $θ$, and focus on the largest eigenvalue, $x$, and the component, $u$, of the corresponding eigenvector in the direction associated to the rank-one perturbation. We obtain the large deviation principle governing the atypical…
▽ More
We consider matrices formed by a random $N\times N$ matrix drawn from the Gaussian Orthogonal Ensemble (or Gaussian Unitary Ensemble) plus a rank-one perturbation of strength $θ$, and focus on the largest eigenvalue, $x$, and the component, $u$, of the corresponding eigenvector in the direction associated to the rank-one perturbation. We obtain the large deviation principle governing the atypical joint fluctuations of $x$ and $u$. Interestingly, for $θ>1$, in large deviations characterized by a small value of $u$, i.e. $u<1-1/θ$, the second-largest eigenvalue pops out from the Wigner semi-circle and the associated eigenvector orients in the direction corresponding to the rank-one perturbation. We generalize these results to the Wishart Ensemble, and we extend them to the first $n$ eigenvalues and the associated eigenvectors.
△ Less
Submitted 3 April, 2019;
originally announced April 2019.
-
Perspective: Gardner Physics in Amorphous Solids and Beyond
Authors:
Ludovic Berthier,
Giulio Biroli,
Patrick Charbonneau,
Eric I. Corwin,
Silvio Franz,
Francesco Zamponi
Abstract:
One of the most remarkable predictions to emerge out of the exact infinite-dimensional solution of the glass problem is the Gardner transition. Although this transition was first theoretically proposed a generation ago for certain mean-field spin glass models, its materials relevance was only realized when a systematic effort to relate glass formation and jamming was undertaken. A number of nontri…
▽ More
One of the most remarkable predictions to emerge out of the exact infinite-dimensional solution of the glass problem is the Gardner transition. Although this transition was first theoretically proposed a generation ago for certain mean-field spin glass models, its materials relevance was only realized when a systematic effort to relate glass formation and jamming was undertaken. A number of nontrivial physical signatures associated to the Gardner transition have since been considered in various areas, from models of structural glasses to constraint satisfaction problems. This Perspective surveys these recent advances and discusses the novel research opportunities that arise from them.
△ Less
Submitted 28 May, 2019; v1 submitted 27 February, 2019;
originally announced February 2019.
-
Numerical implementation of dynamical mean field theory for disordered systems: application to the Lotka-Volterra model of ecosystems
Authors:
Felix Roy,
Giulio Biroli,
Guy Bunin,
Chiara Cammarota
Abstract:
Dynamical mean field theory (DMFT) is a tool that allows to analyze the stochastic dynamics of $N$ interacting degrees of freedom in terms of a self-consistent $1$-body problem. In this work, focusing on models of ecosystems, we present the derivation of DMFT through the dynamical cavity method, and we develop a method for solving it numerically. Our numerical procedure can be applied to a large v…
▽ More
Dynamical mean field theory (DMFT) is a tool that allows to analyze the stochastic dynamics of $N$ interacting degrees of freedom in terms of a self-consistent $1$-body problem. In this work, focusing on models of ecosystems, we present the derivation of DMFT through the dynamical cavity method, and we develop a method for solving it numerically. Our numerical procedure can be applied to a large variety of systems for which DMFT holds. We implement and test it for the generalized random Lotka-Volterra model, and show that complex dynamical regimes characterized by chaos and aging can be captured and studied by this framework.
△ Less
Submitted 28 January, 2019;
originally announced January 2019.
-
Scaling description of generalization with number of parameters in deep learning
Authors:
Mario Geiger,
Arthur Jacot,
Stefano Spigler,
Franck Gabriel,
Levent Sagun,
Stéphane d'Ascoli,
Giulio Biroli,
Clément Hongler,
Matthieu Wyart
Abstract:
Supervised deep learning involves the training of neural networks with a large number $N$ of parameters. For large enough $N$, in the so-called over-parametrized regime, one can essentially fit the training data points. Sparsity-based arguments would suggest that the generalization error increases as $N$ grows past a certain threshold $N^{*}$. Instead, empirical studies have shown that in the over…
▽ More
Supervised deep learning involves the training of neural networks with a large number $N$ of parameters. For large enough $N$, in the so-called over-parametrized regime, one can essentially fit the training data points. Sparsity-based arguments would suggest that the generalization error increases as $N$ grows past a certain threshold $N^{*}$. Instead, empirical studies have shown that in the over-parametrized regime, generalization error keeps decreasing with $N$. We resolve this paradox through a new framework. We rely on the so-called Neural Tangent Kernel, which connects large neural nets to kernel methods, to show that the initialization causes finite-size random fluctuations $\|f_{N}-\bar{f}_{N}\|\sim N^{-1/4}$ of the neural net output function $f_{N}$ around its expectation $\bar{f}_{N}$. These affect the generalization error $ε_{N}$ for classification: under natural assumptions, it decays to a plateau value $ε_{\infty}$ in a power-law fashion $\sim N^{-1/2}$. This description breaks down at a so-called jamming transition $N=N^{*}$. At this threshold, we argue that $\|f_{N}\|$ diverges. This result leads to a plausible explanation for the cusp in test error known to occur at $N^{*}$. Our results are confirmed by extensive empirical observations on the MNIST and CIFAR image datasets. Our analysis finally suggests that, given a computational envelope, the smallest generalization error is obtained using several networks of intermediate sizes, just beyond $N^{*}$, and averaging their outputs.
△ Less
Submitted 8 October, 2019; v1 submitted 6 January, 2019;
originally announced January 2019.
-
Marvels and Pitfalls of the Langevin Algorithm in Noisy High-dimensional Inference
Authors:
Stefano Sarao Mannelli,
Giulio Biroli,
Chiara Cammarota,
Florent Krzakala,
Pierfrancesco Urbani,
Lenka Zdeborová
Abstract:
Gradient-descent-based algorithms and their stochastic versions have widespread applications in machine learning and statistical inference. In this work we perform an analytic study of the performances of one of them, the Langevin algorithm, in the context of noisy high-dimensional inference. We employ the Langevin algorithm to sample the posterior probability measure for the spiked matrix-tensor…
▽ More
Gradient-descent-based algorithms and their stochastic versions have widespread applications in machine learning and statistical inference. In this work we perform an analytic study of the performances of one of them, the Langevin algorithm, in the context of noisy high-dimensional inference. We employ the Langevin algorithm to sample the posterior probability measure for the spiked matrix-tensor model. The typical behaviour of this algorithm is described by a system of integro-differential equations that we call the Langevin state evolution, whose solution is compared with the one of the state evolution of approximate message passing (AMP). Our results show that, remarkably, the algorithmic threshold of the Langevin algorithm is sub-optimal with respect to the one given by AMP. We conjecture this phenomenon to be due to the residual glassiness present in that region of parameters. Finally we show how a landscape-annealing protocol, that uses the Langevin algorithm but violate the Bayes-optimality condition, can approach the performance of AMP.
△ Less
Submitted 13 January, 2020; v1 submitted 21 December, 2018;
originally announced December 2018.
-
A jamming transition from under- to over-parametrization affects loss landscape and generalization
Authors:
Stefano Spigler,
Mario Geiger,
Stéphane d'Ascoli,
Levent Sagun,
Giulio Biroli,
Matthieu Wyart
Abstract:
We argue that in fully-connected networks a phase transition delimits the over- and under-parametrized regimes where fitting can or cannot be achieved. Under some general conditions, we show that this transition is sharp for the hinge loss. In the whole over-parametrized regime, poor minima of the loss are not encountered during training since the number of constraints to satisfy is too small to h…
▽ More
We argue that in fully-connected networks a phase transition delimits the over- and under-parametrized regimes where fitting can or cannot be achieved. Under some general conditions, we show that this transition is sharp for the hinge loss. In the whole over-parametrized regime, poor minima of the loss are not encountered during training since the number of constraints to satisfy is too small to hamper minimization. Our findings support a link between this transition and the generalization properties of the network: as we increase the number of parameters of a given model, starting from an under-parametrized network, we observe that the generalization error displays three phases: (i) initial decay, (ii) increase until the transition point --- where it displays a cusp --- and (iii) slow decay toward a constant for the rest of the over-parametrized regime. Thereby we identify the region where the classical phenomenon of over-fitting takes place, and the region where the model keeps improving, in line with previous empirical observations for modern neural networks.
△ Less
Submitted 18 June, 2019; v1 submitted 22 October, 2018;
originally announced October 2018.
-
Dynamics around the Site Percolation Threshold on High-Dimensional Hypercubic Lattices
Authors:
Giulio Biroli,
Patrick Charbonneau,
Yi Hu
Abstract:
Recent advances on the glass problem motivate reexamining classical models of percolation. Here, we consider the displacement of an ant in a labyrinth near the percolation threshold on cubic lattices both below and above the upper critical dimension of simple percolation, d_u=6. Using theory and simulations, we consider the scaling regime part, and obtain that both caging and subdiffusion scale lo…
▽ More
Recent advances on the glass problem motivate reexamining classical models of percolation. Here, we consider the displacement of an ant in a labyrinth near the percolation threshold on cubic lattices both below and above the upper critical dimension of simple percolation, d_u=6. Using theory and simulations, we consider the scaling regime part, and obtain that both caging and subdiffusion scale logarithmically for d >= d_u. The theoretical derivation considers Bethe lattices with generalized connectivity and a random graph model, and employs a scaling analysis to confirm that logarithmic scalings should persist in the infinite dimension limit. The computational validation employs accelerated random walk simulations with a transfer-matrix description of diffusion to evaluate directly the dynamical critical exponents below d_u as well as their logarithmic scaling above d_u. Our numerical results improve various earlier estimates and are fully consistent with our theoretical predictions.
△ Less
Submitted 17 October, 2018;
originally announced October 2018.
-
Delocalization and ergodicity of the Anderson model on Bethe lattices
Authors:
Giulio Biroli,
Marco Tarzia
Abstract:
We review the state of the art on the delocalized non-ergodic regime of the Anderson model on Bethe lattices. We also present new results using Belief Propagation, which consists in solving the self-consistent recursion relations for the Green's functions directly on a given sample. This allows us to numerically study very large system sizes and to directly access observables related to the eigenf…
▽ More
We review the state of the art on the delocalized non-ergodic regime of the Anderson model on Bethe lattices. We also present new results using Belief Propagation, which consists in solving the self-consistent recursion relations for the Green's functions directly on a given sample. This allows us to numerically study very large system sizes and to directly access observables related to the eigenfunctions and energy level statistics. In agreement with recent works, we establish the existence of a delocalized non-ergodic phase on Cayley trees. On random regular graphs instead our results indicate that ergodicity is recovered when the system size is larger than a cross-over scale $N_c (W)$, which diverges exponentially fast approaching the localization transition. This scale corresponds to the size at which the mean-level spacing becomes smaller than the Thouless energy $E_{Th} (W)$. Such energy scale, which vanishes exponentially fast approaching the localization transition, is the one below which ergodicity in the level statistics is restored in the thermodynamic limit. Remarkably, the behavior of random regular graphs below $N_c (W)$ coincides with the one found close to the root of loop-less infinite Cayley trees, {\it i.e.} only above $N_c (W)$ the effects of loops emerge and random regular graphs behave differently from Cayley trees. Our results indicate that ergodicity is recovered in the thermodynamic limit on random regular graph. However, all observables probing volumes smaller than $N_c(W)$ and times smaller than $\hbar/E_{Th} (W)$ are expected to behave as if there were an intermediate phase. Given the very fast divergence of $N_c(W)$ and $\hbar/E_{Th} (W)$ these non-ergodic effects are very pronounced in a large region preceding the localization transition, and they can be related to the intermediate phase present on Cayley trees.
△ Less
Submitted 17 October, 2018;
originally announced October 2018.
-
The jamming transition as a paradigm to understand the loss landscape of deep neural networks
Authors:
Mario Geiger,
Stefano Spigler,
Stéphane d'Ascoli,
Levent Sagun,
Marco Baity-Jesi,
Giulio Biroli,
Matthieu Wyart
Abstract:
Deep learning has been immensely successful at a variety of tasks, ranging from classification to AI. Learning corresponds to fitting training data, which is implemented by descending a very high-dimensional loss function. Understanding under which conditions neural networks do not get stuck in poor minima of the loss, and how the landscape of that loss evolves as depth is increased remains a chal…
▽ More
Deep learning has been immensely successful at a variety of tasks, ranging from classification to AI. Learning corresponds to fitting training data, which is implemented by descending a very high-dimensional loss function. Understanding under which conditions neural networks do not get stuck in poor minima of the loss, and how the landscape of that loss evolves as depth is increased remains a challenge. Here we predict, and test empirically, an analogy between this landscape and the energy landscape of repulsive ellipses. We argue that in FC networks a phase transition delimits the over- and under-parametrized regimes where fitting can or cannot be achieved. In the vicinity of this transition, properties of the curvature of the minima of the loss are critical. This transition shares direct similarities with the jamming transition by which particles form a disordered solid as the density is increased, which also occurs in certain classes of computational optimization and learning problems such as the perceptron. Our analysis gives a simple explanation as to why poor minima of the loss cannot be encountered in the overparametrized regime, and puts forward the surprising result that the ability of fully connected networks to fit random data is independent of their depth. Our observations suggests that this independence also holds for real data. We also study a quantity $Δ$ which characterizes how well ($Δ<0$) or badly ($Δ>0$) a datum is learned. At the critical point it is power-law distributed, $P_+(Δ)\simΔ^θ$ for $Δ>0$ and $P_-(Δ)\sim(-Δ)^{-γ}$ for $Δ<0$, with $θ\approx0.3$ and $γ\approx0.2$. This observation suggests that near the transition the loss landscape has a hierarchical structure and that the learning dynamics is prone to avalanche-like dynamics, with abrupt changes in the set of patterns that are learned.
△ Less
Submitted 17 June, 2019; v1 submitted 25 September, 2018;
originally announced September 2018.
-
Complexity of energy barriers in mean-field glassy systems
Authors:
Valentina Ros,
Giulio Biroli,
Chiara Cammarota
Abstract:
We analyze the energy barriers that allow escapes from a given local minimum in a mean-field model of glasses. We perform this study by using the Kac-Rice method and computing the typical number of critical points of the energy function at a given distance from the minimum. We analyze their Hessian in terms of random matrix theory and show that for a certain regime of energies and distances critic…
▽ More
We analyze the energy barriers that allow escapes from a given local minimum in a mean-field model of glasses. We perform this study by using the Kac-Rice method and computing the typical number of critical points of the energy function at a given distance from the minimum. We analyze their Hessian in terms of random matrix theory and show that for a certain regime of energies and distances critical points are index-one saddles and are associated to barriers. We find that the lowest barrier, important for activated dynamics at low temperature, is strictly lower than the "threshold" level above which saddles proliferate. We characterize how the quenched complexity of barriers, important for activated process at finite temperature, depends on the energy of the barrier, the energy of the initial minimum, and the distance between them. The overall picture gained from this study is expected to hold generically for mean-field models of the glass transition.
△ Less
Submitted 14 September, 2018;
originally announced September 2018.
-
Random Field Ising-like effective theory of the glass transition II: Finite Dimensional Models
Authors:
G. Biroli,
C. Cammarota,
G. Tarjus,
M. Tarzia
Abstract:
As in the preceding paper we aim at identifying the effective theory that describes the fluctuations of the local overlap with an equilibrium reference configuration close to a putative thermodynamic glass transition. We focus here on the case of finite-dimensional glass-forming systems, in particular supercooled liquids. The main difficulty for going beyond the mean-field treatment comes from the…
▽ More
As in the preceding paper we aim at identifying the effective theory that describes the fluctuations of the local overlap with an equilibrium reference configuration close to a putative thermodynamic glass transition. We focus here on the case of finite-dimensional glass-forming systems, in particular supercooled liquids. The main difficulty for going beyond the mean-field treatment comes from the presence of diverging point-to-set spatial correlations. We introduce a variational low-temperature approximation scheme that allows us to account, at least in part, for the effect of these correlations. The outcome is an effective theory for the overlap fluctuations in terms of a random-field + random-bond Ising model with additional, power-law decaying, pair and multi-body interactions generated by the point-to-set correlations. This theory is much more tractable than the original problem. We check the robustness of the approximation scheme by applying it to a fully connected model already studied in the companion paper. We discuss the physical implications of this mapping for glass-forming liquids and the possibility it offers to determine the presence or not of a finite-temperature thermodynamic glass transition.
△ Less
Submitted 17 July, 2018;
originally announced July 2018.
-
Random-Field Ising like effective theory of the glass transition: I Mean-Field Models
Authors:
G. Biroli,
C. Cammarota,
G. Tarjus,
M. Tarzia
Abstract:
In this paper and in the companion one we address the problem of identifying the effective theory that describes the statistics of the fluctuations of what is thought to be the relevant order parameter for glassy systems---the overlap field with an equilibrium reference configuration---close to the putative thermodynamic glass transition. Our starting point is the mean-field theory of glass format…
▽ More
In this paper and in the companion one we address the problem of identifying the effective theory that describes the statistics of the fluctuations of what is thought to be the relevant order parameter for glassy systems---the overlap field with an equilibrium reference configuration---close to the putative thermodynamic glass transition. Our starting point is the mean-field theory of glass formation which relies on the existence of a complex free-energy landscape with a multitude of metastable states. In this paper, we focus on archetypal mean-field models possessing this type of free-energy landscape and set up the framework to determine the exact effective theory. We show that the effective theory at the mean-field level is generically of the random-field + random-bond Ising type. We also discuss what are the main issues concerning the extension of our result to finite-dimensional systems. This extension is addressed in detail in the companion paper.
△ Less
Submitted 17 July, 2018;
originally announced July 2018.
-
Can the glass transition be explained without a growing static length scale?
Authors:
Ludovic Berthier,
Giulio Biroli,
Jean-Philippe Bouchaud,
Gilles Tarjus
Abstract:
It was recently discovered that SWAP, a Monte Carlo algorithm that involves the exchange of pairs of particles of differing diameters, can dramatically accelerate the equilibration of simulated supercooled liquids in regimes where the normal dynamics is glassy. This spectacular effect was subsequently interpreted as direct evidence against a static, cooperative explanation of the glass transition…
▽ More
It was recently discovered that SWAP, a Monte Carlo algorithm that involves the exchange of pairs of particles of differing diameters, can dramatically accelerate the equilibration of simulated supercooled liquids in regimes where the normal dynamics is glassy. This spectacular effect was subsequently interpreted as direct evidence against a static, cooperative explanation of the glass transition such as the one offered by the random first-order transition (RFOT) theory. We review several empirical facts that support the opposite view, namely, that a local mechanism cannot explain the glass transition phenomenology. We explain the speedup induced by SWAP within the framework of the RFOT theory. We suggest that the efficiency of SWAP stems from a postponed onset of glassy dynamics, which allows the efficient exploration of configuration space even in the regime where the physical dynamics is dominated by activated events across free-energy barriers. We describe this effect in terms of `crumbling metastability' and use the example of nucleation to illustrate the possibility of circumventing free-energy barriers of thermodynamic origin by a change of the local dynamical rules.
△ Less
Submitted 3 July, 2018; v1 submitted 31 May, 2018;
originally announced May 2018.
-
Activated dynamics: an intermediate model between REM and p-spin
Authors:
Marco Baity-Jesi,
Alexandre Achard-de Lustrac,
Giulio Biroli
Abstract:
In order to study the activated dynamics of mean-field glasses, which takes place on times of order exp(N), where N is the system size, we introduce a new model, the Correlated Random Energy Model (CREM), that allows for a smooth interpolation between the REM and the p-spin models. We study numerically and analytically the CREM in the intermediate regime between REM and p-spin. We fully characteri…
▽ More
In order to study the activated dynamics of mean-field glasses, which takes place on times of order exp(N), where N is the system size, we introduce a new model, the Correlated Random Energy Model (CREM), that allows for a smooth interpolation between the REM and the p-spin models. We study numerically and analytically the CREM in the intermediate regime between REM and p-spin. We fully characterize its energy landscape, which is like a golf-course but, at variance with the REM, has metabasins (or holes) containing several configurations. We find that an effective trap-like description for the dynamics emerges, provided that one identifies metabasins in the CREM with configurations in the trap model.
△ Less
Submitted 11 May, 2018;
originally announced May 2018.
-
Complex energy landscapes in spiked-tensor and simple glassy models: ruggedness, arrangements of local minima and phase transitions
Authors:
Valentina Ros,
Gerard Ben Arous,
Giulio Biroli,
Chiara Cammarota
Abstract:
We study rough high-dimensional landscapes in which an increasingly stronger preference for a given configuration emerges. Such energy landscapes arise in glass physics and inference. In particular we focus on random Gaussian functions, and on the spiked-tensor model and generalizations. We thoroughly analyze the statistical properties of the corresponding landscapes and characterize the associate…
▽ More
We study rough high-dimensional landscapes in which an increasingly stronger preference for a given configuration emerges. Such energy landscapes arise in glass physics and inference. In particular we focus on random Gaussian functions, and on the spiked-tensor model and generalizations. We thoroughly analyze the statistical properties of the corresponding landscapes and characterize the associated geometrical phase transitions. In order to perform our study, we develop a framework based on the Kac-Rice method that allows to compute the complexity of the landscape, i.e. the logarithm of the typical number of stationary points and their Hessian. This approach generalizes the one used to compute rigorously the annealed complexity of mean-field glass models. We discuss its advantages with respect to previous frameworks, in particular the thermodynamical replica method which is shown to lead to partially incorrect predictions.
△ Less
Submitted 24 April, 2018; v1 submitted 8 April, 2018;
originally announced April 2018.
-
A random critical point separates brittle and ductile yielding transitions in amorphous materials
Authors:
Misaki Ozawa,
Ludovic Berthier,
Giulio Biroli,
Alberto Rosso,
Gilles Tarjus
Abstract:
We combine an analytically solvable mean-field elasto-plastic model with molecular dynamics simulations of a generic glass-former to demonstrate that, depending on their preparation protocol, amorphous materials can yield in two qualitatively distinct ways. We show that well-annealed systems yield in a discontinuous brittle way, as metallic and molecular glasses do. Yielding corresponds in this ca…
▽ More
We combine an analytically solvable mean-field elasto-plastic model with molecular dynamics simulations of a generic glass-former to demonstrate that, depending on their preparation protocol, amorphous materials can yield in two qualitatively distinct ways. We show that well-annealed systems yield in a discontinuous brittle way, as metallic and molecular glasses do. Yielding corresponds in this case to a first-order nonequilibrium phase transition. As the degree of annealing decreases, the first-order character becomes weaker and the transition terminates in a second-order critical point in the universality class of an Ising model in a random field. For even more poorly annealed systems, yielding becomes a smooth crossover, representative of the ductile rheological behavior generically observed in foams, emulsions, and colloidal glasses. Our results show that the variety of yielding behavior found in amorphous materials does not result from the diversity of particle interactions or microscopic dynamics {\it per se}, but is instead unified by carefully considering the role of the initial stability of the system.
△ Less
Submitted 11 May, 2018; v1 submitted 30 March, 2018;
originally announced March 2018.
-
Comparing Dynamics: Deep Neural Networks versus Glassy Systems
Authors:
M. Baity-Jesi,
L. Sagun,
M. Geiger,
S. Spigler,
G. Ben Arous,
C. Cammarota,
Y. LeCun,
M. Wyart,
G. Biroli
Abstract:
We analyze numerically the training dynamics of deep neural networks (DNN) by using methods developed in statistical physics of glassy systems. The two main issues we address are (1) the complexity of the loss landscape and of the dynamics within it, and (2) to what extent DNNs share similarities with glassy systems. Our findings, obtained for different architectures and datasets, suggest that dur…
▽ More
We analyze numerically the training dynamics of deep neural networks (DNN) by using methods developed in statistical physics of glassy systems. The two main issues we address are (1) the complexity of the loss landscape and of the dynamics within it, and (2) to what extent DNNs share similarities with glassy systems. Our findings, obtained for different architectures and datasets, suggest that during the training process the dynamics slows down because of an increasingly large number of flat directions. At large times, when the loss is approaching zero, the system diffuses at the bottom of the landscape. Despite some similarities with the dynamics of mean-field glassy systems, in particular, the absence of barrier crossing, we find distinctive dynamical behaviors in the two cases, showing that the statistical properties of the corresponding loss and energy landscapes are different. In contrast, when the network is under-parametrized we observe a typical glassy behavior, thus suggesting the existence of different phases depending on whether the network is under-parametrized or over-parametrized.
△ Less
Submitted 7 June, 2018; v1 submitted 19 March, 2018;
originally announced March 2018.
-
Experimental Determination of Configurational Entropy in a Two-Dimensional Liquid under Random Pinning
Authors:
Ian Williams,
Francesco Turci,
James E. Hallett,
Peter Crowther,
Chiara Cammarota,
Giulio Biroli,
C. Patrick Royall
Abstract:
A quasi two-dimensional colloidal suspension is studied under the influence of immobilisation (pinning) of a random fraction of its particles. We introduce a novel experimental method to perform random pinning and, with the support of numerical simulation, we find that increasing the pinning concentration smoothly arrests the system, with a cross-over from a regime of high mobility and high entrop…
▽ More
A quasi two-dimensional colloidal suspension is studied under the influence of immobilisation (pinning) of a random fraction of its particles. We introduce a novel experimental method to perform random pinning and, with the support of numerical simulation, we find that increasing the pinning concentration smoothly arrests the system, with a cross-over from a regime of high mobility and high entropy to a regime of low mobility and low entropy. At the local level, we study fluctuations in area fraction and concentration of pins and map them to entropic structural signatures and local mobility, obtaining a measure for the local entropic fluctuations of the experimental system.
△ Less
Submitted 7 March, 2018;
originally announced March 2018.
-
Unifying different interpretations of the nonlinear response in glass-forming liquids
Authors:
P. Gadige,
S. Albert,
M. Mich,
Th. Bauer,
P. Lunkenheimer,
A. Loidl,
R. Tourbot,
C. Wiertel-Gasquet,
G. Biroli,
J. -P. Bouchaud,
F. Ladieu
Abstract:
This work aims at reconsidering several interpretations coexisting in the recent literature concerning non-linear susceptibilities in supercooled liquids. We present experimental results on glycerol and propylene carbonate showing that the three independent cubic susceptibilities have very similar frequency and temperature dependences, both for their amplitudes and phases. This strongly suggests a…
▽ More
This work aims at reconsidering several interpretations coexisting in the recent literature concerning non-linear susceptibilities in supercooled liquids. We present experimental results on glycerol and propylene carbonate showing that the three independent cubic susceptibilities have very similar frequency and temperature dependences, both for their amplitudes and phases. This strongly suggests a unique physical mechanism responsible for the growth of these non-linear susceptibilities. We show that the framework proposed by two of us [BB, Phys. Rev. B 72, 064204 (2005)], where the growth of non-linear susceptibilities is intimately related to the growth of "glassy domains", accounts for all the salient experimental features. We then review several complementary and/or alternative models, and show that the notion of cooperatively rearranging glassy domains is a key (implicit or explicit) ingredient to all of them. This paves the way for future experiments which should deepen our understanding of glasses.
△ Less
Submitted 1 November, 2017;
originally announced November 2017.
-
Out-of-equilibrium dynamical mean-field equations for the perceptron model
Authors:
Elisabeth Agoritsas,
Giulio Biroli,
Pierfrancesco Urbani,
Francesco Zamponi
Abstract:
Perceptrons are the building blocks of many theoretical approaches to a wide range of complex systems, ranging from neural networks and deep learning machines, to constraint satisfaction problems, glasses and ecosystems. Despite their applicability and importance, a detailed study of their Langevin dynamics has never been performed yet. Here we derive the mean-field dynamical equations that descri…
▽ More
Perceptrons are the building blocks of many theoretical approaches to a wide range of complex systems, ranging from neural networks and deep learning machines, to constraint satisfaction problems, glasses and ecosystems. Despite their applicability and importance, a detailed study of their Langevin dynamics has never been performed yet. Here we derive the mean-field dynamical equations that describe the continuous random perceptron in the thermodynamic limit, in a very general setting with arbitrary noise and friction kernels, not necessarily related by equilibrium relations. We derive the equations in two ways: via a dynamical cavity method, and via a path-integral approach in its supersymmetric formulation. The end point of both approaches is the reduction of the dynamics of the system to an effective stochastic process for a representative dynamical variable. Because the perceptron is formally very close to a system of interacting particles in a high dimensional space, the methods we develop here can be transferred to the study of liquid and glasses in high dimensions. Potentially interesting applications are thus the study of the glass transition in active matter, the study of the dynamics around the jamming transition, and the calculation of rheological properties in driven systems.
△ Less
Submitted 19 November, 2018; v1 submitted 13 October, 2017;
originally announced October 2017.
-
Marginally Stable Equilibria in Critical Ecosystems
Authors:
Giulio Biroli,
Guy Bunin,
Chiara Cammarota
Abstract:
In this work we study the stability of the equilibria reached by ecosystems formed by a large number of species. The model we focus on are Lotka-Volterra equations with symmetric random interactions. Our theoretical analysis, confirmed by our numerical studies, shows that for strong and heterogeneous interactions the system displays multiple equilibria which are all marginally stable. This propert…
▽ More
In this work we study the stability of the equilibria reached by ecosystems formed by a large number of species. The model we focus on are Lotka-Volterra equations with symmetric random interactions. Our theoretical analysis, confirmed by our numerical studies, shows that for strong and heterogeneous interactions the system displays multiple equilibria which are all marginally stable. This property allows us to obtain general identities between diversity and single species responses, which generalize and saturate May's bound. By connecting the model to systems studied in condensed matter physics, we show that the multiple equilibria regime is analogous to a critical spin-glass phase. This relation provides a new perspective as to why many systems in several different fields appear to be poised at the edge of stability and also suggests new experimental ways to probe marginal stability.
△ Less
Submitted 10 October, 2017;
originally announced October 2017.
-
Activated Aging Dynamics and Effective Trap Model Description in the Random Energy Model
Authors:
Marco Baity-Jesi,
Giulio Biroli,
Chiara Cammarota
Abstract:
We study the out-of-equilibrium aging dynamics of the Random Energy Model (REM) ruled by a single spin-flip Metropolis dynamics. We focus on the dynamical evolution taking place on time-scales diverging with the system size. Our aim is to show to what extent the activated dynamics displayed by the REM can be described in terms of an effective trap model. We identify two time regimes: the first one…
▽ More
We study the out-of-equilibrium aging dynamics of the Random Energy Model (REM) ruled by a single spin-flip Metropolis dynamics. We focus on the dynamical evolution taking place on time-scales diverging with the system size. Our aim is to show to what extent the activated dynamics displayed by the REM can be described in terms of an effective trap model. We identify two time regimes: the first one corresponds to the process of escaping from a basin in the energy landscape and to the subsequent exploration of high energy configurations, whereas the second one corresponds to the evolution from a deep basin to the other. By combining numerical simulations with analytical arguments we show why the trap model description does not hold in the former but becomes exact in the second.
△ Less
Submitted 29 November, 2017; v1 submitted 10 August, 2017;
originally announced August 2017.
-
Delocalized Glassy Dynamics and Many Body Localization
Authors:
Giulio Biroli,
Marco Tarzia
Abstract:
We analyze the unusual slow dynamics that emerges in the bad metal delocalized phase preceding the Many-Body Localization transition by using single-particle Anderson Localization on the Bethe lattice as a toy model of many-body dynamics in Fock space. We probe the dynamical evolution by measuring observables such as the imbalance and equilibrium correlation functions, which display slow dynamics…
▽ More
We analyze the unusual slow dynamics that emerges in the bad metal delocalized phase preceding the Many-Body Localization transition by using single-particle Anderson Localization on the Bethe lattice as a toy model of many-body dynamics in Fock space. We probe the dynamical evolution by measuring observables such as the imbalance and equilibrium correlation functions, which display slow dynamics and power-laws strikingly similar to the ones observed in recent simulations and experiments. We relate this unusual behavior to the non-ergodic spectral statistics found on Bethe lattices. We discuss different scenarii, such as a true intermediate phase which persists in the thermodynamic limit versus a glassy regime established on finite but very large time and length-scales only, and their implications for real space dynamical properties. In the latter, slow dynamics and power-laws extend on a very large time-window but are eventually cut-off on a time-scale that diverges at the MBL transition.
△ Less
Submitted 8 June, 2017;
originally announced June 2017.
-
(Non) equilibrium dynamics: a (broken) symmetry of the Keldysh generating functional
Authors:
Camille Aron,
Giulio Biroli,
Leticia F. Cugliandolo
Abstract:
We unveil the universal (model-independent) symmetry satisfied by Schwinger-Keldysh quantum field theories whenever they describe equilibrium dynamics. This is made possible by a generalization of the Schwinger-Keldysh path-integral formalism in which the physical time can be re-parametrized to arbitrary contours in the complex plane. Strong relations between correlation functions, such as the flu…
▽ More
We unveil the universal (model-independent) symmetry satisfied by Schwinger-Keldysh quantum field theories whenever they describe equilibrium dynamics. This is made possible by a generalization of the Schwinger-Keldysh path-integral formalism in which the physical time can be re-parametrized to arbitrary contours in the complex plane. Strong relations between correlation functions, such as the fluctuation-dissipation theorems, are derived as immediate consequences of this symmetry of equilibrium. In this view, quantum non-equilibrium dynamics -- e.g. when driving with a time-dependent potential -- are seen as symmetry-breaking processes. The symmetry-breaking terms of the action are identified as a measure of irreversibility, or entropy creation, defined at the level of a single quantum trajectory. Moreover, they are shown to obey quantum fluctuation theorems. These results extend stochastic thermodynamics to the quantum realm.
△ Less
Submitted 20 December, 2017; v1 submitted 30 May, 2017;
originally announced May 2017.
-
Liu-Nagel phase diagrams in infinite dimension
Authors:
Giulio Biroli,
Pierfrancesco Urbani
Abstract:
We study Harmonic Soft Spheres as a model of thermal structural glasses in the limit of infinite dimensions. We show that cooling, compressing and shearing a glass lead to a Gardner transition and, hence, to a marginally stable amorphous solid as found for Hard Spheres systems. A general outcome of our results is that a reduced stability of the glass favors the appearance of the Gardner transition…
▽ More
We study Harmonic Soft Spheres as a model of thermal structural glasses in the limit of infinite dimensions. We show that cooling, compressing and shearing a glass lead to a Gardner transition and, hence, to a marginally stable amorphous solid as found for Hard Spheres systems. A general outcome of our results is that a reduced stability of the glass favors the appearance of the Gardner transition. Therefore using strong perturbations, e.g. shear and compression, on standard glasses or using weak perturbations on weakly stable glasses, e.g. the ones prepared close to the jamming point, are the generic ways to induce a Gardner transition. The formalism that we discuss allows to study general perturbations, including strain deformations that are important to study soft glassy rheology at the mean field level.
△ Less
Submitted 14 January, 2018; v1 submitted 15 April, 2017;
originally announced April 2017.
-
Real Space Migdal-Kadanoff Renormalisation of Glassy Systems: Recent Results and a Critical Assessment
Authors:
Maria Chiara Angelini,
Giulio Biroli
Abstract:
In this manuscript, in honour of L. Kadanoff, we present recent progress obtained in the description of finite dimensional glassy systems thanks to the Migdal-Kadanoff renormalisation group (MK-RG). We provide a critical assessment of the method, in particular discuss its limitation in describing situations in which an infinite number of pure states might be present, and analyse the MK-RG flow in…
▽ More
In this manuscript, in honour of L. Kadanoff, we present recent progress obtained in the description of finite dimensional glassy systems thanks to the Migdal-Kadanoff renormalisation group (MK-RG). We provide a critical assessment of the method, in particular discuss its limitation in describing situations in which an infinite number of pure states might be present, and analyse the MK-RG flow in the limit of infinite dimensions. MK-RG predicts that the spin-glass transition in a field and the glass transition are governed by zero-temperature fixed points of the renormalization group flow. This implies a typical energy scale that grows, approaching the transition, as a power of the correlation length, thus leading to enormously large time-scales as expected from experiments and simulations. These fixed points exist only in dimensions larger than $d_L>3$ but they nevertheless influence the RG flow below it, in particular in three dimensions. MK-RG thus predicts a similar behavior for spin-glasses in a field and models of glasses and relates it to the presence of avoided critical points.
△ Less
Submitted 10 February, 2017;
originally announced February 2017.
-
Critical properties of the Anderson localization transition and the high dimensional limit
Authors:
Elena Tarquini,
Giulio Biroli,
Marco Tarzia
Abstract:
In this paper we present a thorough study of transport, spectral and wave-function properties at the Anderson localization critical point in spatial dimensions $d = 3$, $4$, $5$, $6$. Our aim is to analyze the dimensional dependence and to asses the role of the $d\rightarrow \infty$ limit provided by Bethe lattices and tree-like structures. Our results strongly suggest that the upper critical dime…
▽ More
In this paper we present a thorough study of transport, spectral and wave-function properties at the Anderson localization critical point in spatial dimensions $d = 3$, $4$, $5$, $6$. Our aim is to analyze the dimensional dependence and to asses the role of the $d\rightarrow \infty$ limit provided by Bethe lattices and tree-like structures. Our results strongly suggest that the upper critical dimension of Anderson localization is infinite. Furthermore, we find that the $d_U=\infty$ is a much better starting point compared to $d_L=2$ to describe even three dimensional systems. We find that critical properties and finite size scaling behavior approach by increasing $d$ the ones found for Bethe lattices: the critical state becomes an insulator characterized by Poisson statistics and corrections to the thermodynamics limit become logarithmic in $N$. In the conclusion, we present physical consequences of our results, propose connections with the non-ergodic delocalised phase suggested for the Anderson model on infinite dimensional lattices and discuss perspectives for future research studies.
△ Less
Submitted 14 December, 2016;
originally announced December 2016.