Skip to main content

Showing 1–50 of 172 results for author: Zdeborova, L

Searching in archive cond-mat. Search in all archives.
.
  1. arXiv:2506.15400  [pdf, ps, other

    cond-mat.dis-nn cs.IT math.PR

    The maximum-average subtensor problem: equilibrium and out-of-equilibrium properties

    Authors: Vittorio Erba, Nathan Malo Kupferschmid, Rodrigo Pérez Ortiz, Lenka Zdeborová

    Abstract: In this paper we introduce and study the Maximum-Average Subtensor ($p$-MAS) problem, in which one wants to find a subtensor of size $k$ of a given random tensor of size $N$, both of order $p$, with maximum sum of entries. We are motivated by recent work on the matrix case of the problem in which several equilibrium and non-equilibrium properties have been characterized analytically in the asympto… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  2. arXiv:2506.12454  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.CR cs.LG

    On the existence of consistent adversarial attacks in high-dimensional linear classification

    Authors: Matteo Vilucchio, Lenka Zdeborová, Bruno Loureiro

    Abstract: What fundamentally distinguishes an adversarial attack from a misclassification due to limited model expressivity or finite data? In this work, we investigate this question in the setting of high-dimensional binary classification, where statistical effects due to limited data availability play a central role. We introduce a new error metric that precisely capture this distinction, quantifying mode… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  3. arXiv:2506.09877  [pdf, ps, other

    cond-mat.dis-nn math-ph math.PR

    Sequential Dynamics in Ising Spin Glasses

    Authors: Yatin Dandi, David Gamarnik, Francisco Pernice, Lenka Zdeborová

    Abstract: We present the first exact asymptotic characterization of sequential dynamics for a broad class of local update algorithms on the Sherrington-Kirkpatrick (SK) model with Ising spins. Focusing on dynamics implemented via systematic scan -- encompassing Glauber updates at any temperature -- we analyze the regime where the number of spin updates scales linearly with system size. Our main result provi… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 55 pages, 6 figures

  4. arXiv:2506.02664  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Computational Thresholds in Multi-Modal Learning via the Spiked Matrix-Tensor Model

    Authors: Hugo Tabanelli, Pierre Mergny, Lenka Zdeborova, Florent Krzakala

    Abstract: We study the recovery of multiple high-dimensional signals from two noisy, correlated modalities: a spiked matrix and a spiked tensor sharing a common low-rank structure. This setting generalizes classical spiked matrix and tensor models, unveiling intricate interactions between inference channels and surprising algorithmic behaviors. Notably, while the spiked tensor model is typically intractable… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  5. arXiv:2506.02651  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG

    Asymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention Networks

    Authors: Luca Arnaboldi, Bruno Loureiro, Ludovic Stephan, Florent Krzakala, Lenka Zdeborova

    Abstract: We study the dynamics of stochastic gradient descent (SGD) for a class of sequence models termed Sequence Single-Index (SSI) models, where the target depends on a single direction in input space applied to a sequence of tokens. This setting generalizes classical single-index models to the sequential domain, encompassing simplified one-layer attention architectures. We derive a closed-form expressi… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  6. arXiv:2506.01582  [pdf, ps, other

    cs.LG cond-mat.dis-nn cs.IT stat.ML

    Bayes optimal learning of attention-indexed models

    Authors: Fabrizio Boncoraglio, Emanuele Troiani, Vittorio Erba, Lenka Zdeborová

    Abstract: We introduce the attention-indexed model (AIM), a theoretical framework for analyzing learning in deep attention layers. Inspired by multi-index models, AIM captures how token-level outputs emerge from layered bilinear interactions over high-dimensional embeddings. Unlike prior tractable attention models, AIM allows full-width key and query matrices, aligning more closely with practical transforme… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  7. arXiv:2505.18046  [pdf, ps, other

    cs.LG cond-mat.dis-nn stat.ML

    Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions

    Authors: Yizhou Xu, Florent Krzakala, Lenka Zdeborová

    Abstract: The Restricted Boltzmann Machine (RBM) is one of the simplest generative neural networks capable of learning input distributions. Despite its simplicity, the analysis of its performance in learning from the training data is only well understood in cases that essentially reduce to singular value decomposition of the data. Here, we consider the limit of a large dimension of the input space and a con… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  8. arXiv:2505.17958  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.IT cs.LG

    The Nuclear Route: Sharp Asymptotics of ERM in Overparameterized Quadratic Networks

    Authors: Vittorio Erba, Emanuele Troiani, Lenka Zdeborová, Florent Krzakala

    Abstract: We study the high-dimensional asymptotics of empirical risk minimization (ERM) in over-parametrized two-layer neural networks with quadratic activations trained on synthetic data. We derive sharp asymptotics for both training and test errors by mapping the $\ell_2$-regularized learning problem to a convex matrix sensing task with nuclear norm penalization. This reveals that capacity control in suc… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  9. arXiv:2503.14121  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.IT cs.LG math.PR

    Fundamental Limits of Matrix Sensing: Exact Asymptotics, Universality, and Applications

    Authors: Yizhou Xu, Antoine Maillard, Lenka Zdeborová, Florent Krzakala

    Abstract: In the matrix sensing problem, one wishes to reconstruct a matrix from (possibly noisy) observations of its linear projections along given directions. We consider this model in the high-dimensional limit: while previous works on this model primarily focused on the recovery of low-rank matrices, we consider in this work more general classes of structured signal matrices with potentially large rank,… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  10. arXiv:2503.01361  [pdf, other

    cond-mat.dis-nn cs.LG

    Statistical physics analysis of graph neural networks: Approaching optimality in the contextual stochastic block model

    Authors: O. Duranthon, L. Zdeborová

    Abstract: Graph neural networks (GNNs) are designed to process data associated with graphs. They are finding an increasing range of applications; however, as with other modern machine learning techniques, their theoretical understanding is limited. GNNs can encounter difficulties in gathering information from nodes that are far apart by iterated aggregation steps. This situation is partly caused by so-calle… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  11. arXiv:2502.00901  [pdf, other

    cs.LG cond-mat.dis-nn

    Fundamental limits of learning in sequence multi-index models and deep attention networks: High-dimensional asymptotics and sharp thresholds

    Authors: Emanuele Troiani, Hugo Cui, Yatin Dandi, Florent Krzakala, Lenka Zdeborová

    Abstract: In this manuscript, we study the learning of deep attention neural networks, defined as the composition of multiple self-attention layers, with tied and low-rank weights. We first establish a mapping of such models to sequence multi-index models, a generalization of the widely studied multi-index model to sequential covariates, for which we establish a number of general results. In the context of… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  12. arXiv:2412.14794  [pdf, other

    cond-mat.dis-nn

    Dynamical Cavity Method for Hypergraphs and its Application to Quenches in the k-XOR-SAT Problem

    Authors: Aude Maier, Freya Behrens, Lenka Zdeborová

    Abstract: The dynamical cavity method and its backtracking version provide a powerful approach to studying the properties of dynamical processes on large random graphs. This paper extends these methods to hypergraphs, enabling the analysis of interactions involving more than two variables. We apply them to analyse the $k$-XOR-satisfiability ($k$-XOR-SAT) problem, an important model in theoretical computer s… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 26 pages

  13. arXiv:2410.18858  [pdf, other

    cond-mat.dis-nn cs.LG

    Bilinear Sequence Regression: A Model for Learning from Long Sequences of High-dimensional Tokens

    Authors: Vittorio Erba, Emanuele Troiani, Luca Biggio, Antoine Maillard, Lenka Zdeborová

    Abstract: Current progress in artificial intelligence is centered around so-called large language models that consist of neural networks processing long sequences of high-dimensional vectors called tokens. Statistical physics provides powerful tools to study the functioning of learning with neural networks and has played a recognized role in the development of modern machine learning. The statistical physic… ▽ More

    Submitted 21 May, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  14. arXiv:2410.16493  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Building Conformal Prediction Intervals with Approximate Message Passing

    Authors: Lucas Clarté, Lenka Zdeborová

    Abstract: Conformal prediction has emerged as a powerful tool for building prediction intervals that are valid in a distribution-free way. However, its evaluation may be computationally costly, especially in the high-dimensional setting where the dimensionality and sample sizes are both large and of comparable magnitudes. To address this challenge in the context of generalized linear regression, we propose… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  15. arXiv:2408.08319  [pdf, other

    cs.IT cond-mat.dis-nn

    The phase diagram of compressed sensing with $\ell_0$-norm regularization

    Authors: Damien Barbier, Carlo Lucibello, Luca Saglietti, Florent Krzakala, Lenka Zdeborová

    Abstract: Noiseless compressive sensing is a two-steps setting that allows for undersampling a sparse signal and then reconstructing it without loss of information. The LASSO algorithm, based on $\lone$ regularization, provides an efficient and robust to address this problem, but it fails in the regime of very high compression rate. Here we present two algorithms based on $\lzero$-norm regularization instea… ▽ More

    Submitted 22 August, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2304.12127

  16. arXiv:2408.03733  [pdf, other

    stat.ML cond-mat.dis-nn cs.IT cs.LG math.PR

    Bayes-optimal learning of an extensive-width neural network from quadratically many samples

    Authors: Antoine Maillard, Emanuele Troiani, Simon Martin, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider the problem of learning a target function corresponding to a single hidden layer neural network, with a quadratic activation function after the first layer, and random weights. We consider the asymptotic limit where the input dimension and the network width are proportionally large. Recent work [Cui & al '23] established that linear regression provides Bayes-optimal test error to learn… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 47 pages

    Journal ref: Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

  17. arXiv:2407.03522  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Optimal thresholds and algorithms for a model of multi-modal learning in high dimensions

    Authors: Christian Keup, Lenka Zdeborová

    Abstract: This work explores multi-modal inference in a high-dimensional simplified model, analytically quantifying the performance gain of multi-modal inference over that of analyzing modalities in isolation. We present the Bayes-optimal performance and weak recovery thresholds in a model where the objective is to recover the latent structures from two noisy data matrices with correlated spikes. The paper… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  18. arXiv:2406.01710  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech

    Counting and Hardness-of-Finding Fixed Points in Cellular Automata on Random Graphs

    Authors: Cédric Koller, Freya Behrens, Lenka Zdeborová

    Abstract: We study the fixed points of outer-totalistic cellular automata on sparse random regular graphs. These can be seen as constraint satisfaction problems, where each variable must adhere to the same local constraint, which depends solely on its state and the total number of its neighbors in each possible state. Examples of this setting include classical problems such as independent sets or assortativ… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 26 pages, 7 figures

    Journal ref: J. Phys. A: Math. Theor. 57 (2024) 465001

  19. arXiv:2405.15480  [pdf, other

    cs.LG cond-mat.dis-nn cs.CC

    Fundamental computational limits of weak learnability in high-dimensional multi-index models

    Authors: Emanuele Troiani, Yatin Dandi, Leonardo Defilippis, Lenka Zdeborová, Bruno Loureiro, Florent Krzakala

    Abstract: Multi-index models - functions which only depend on the covariates through a non-linear transformation of their projection on a subspace - are a useful benchmark for investigating feature learning with neural nets. This paper examines the theoretical boundaries of efficient learnability in this hypothesis class, focusing on the minimum sample complexity required for weakly recovering their low-dim… ▽ More

    Submitted 2 April, 2025; v1 submitted 24 May, 2024; originally announced May 2024.

  20. arXiv:2405.10763  [pdf, other

    cond-mat.dis-nn cs.DM math.OC stat.CO

    Integer Traffic Assignment Problem: Algorithms and Insights on Random Graphs

    Authors: Rayan Harfouche, Giovanni Piccioli, Lenka Zdeborová

    Abstract: Path optimization is a fundamental concern across various real-world scenarios, ranging from traffic congestion issues to efficient data routing over the internet. The Traffic Assignment Problem (TAP) is a classic continuous optimization problem in this field. This study considers the Integer Traffic Assignment Problem (ITAP), a discrete variant of TAP. ITAP involves determining optimal routes for… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 37 pages, 15 figures

    Journal ref: Physical Review E 111.1 (2025): 014316

  21. Quenches in the Sherrington-Kirkpatrick model

    Authors: Vittorio Erba, Freya Behrens, Florent Krzakala, Lenka Zdeborová

    Abstract: The Sherrington-Kirkpatrick (SK) model is a prototype of a complex non-convex energy landscape. Dynamical processes evolving on such landscapes and locally aiming to reach minima are generally poorly understood. Here, we study quenches, i.e. dynamics that locally aim to decrease energy. We analyse the energy at convergence for two distinct algorithmic classes, single-spin flip and synchronous dyna… ▽ More

    Submitted 17 July, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Journal ref: J. Stat. Mech. (2024) 083302

  22. arXiv:2402.13622  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG

    Analysis of Bootstrap and Subsampling in High-dimensional Regularized Regression

    Authors: Lucas Clarté, Adrien Vandenbroucque, Guillaume Dalle, Bruno Loureiro, Florent Krzakala, Lenka Zdeborová

    Abstract: We investigate popular resampling methods for estimating the uncertainty of statistical models, such as subsampling, bootstrap and the jackknife, and their performance in high-dimensional supervised regression tasks. We provide a tight asymptotic description of the biases and variances estimated by these methods in the context of generalized linear models, such as ridge and logistic regression, ta… ▽ More

    Submitted 1 November, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence, PMLR 244:787-819, 2024

  23. arXiv:2402.04980  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Asymptotics of feature learning in two-layer networks after one gradient-step

    Authors: Hugo Cui, Luca Pesce, Yatin Dandi, Florent Krzakala, Yue M. Lu, Lenka Zdeborová, Bruno Loureiro

    Abstract: In this manuscript, we investigate the problem of how two-layer neural networks learn features from data, and improve over the kernel regime, after being trained with a single gradient descent step. Leveraging the insight from (Ba et al., 2022), we model the trained network by a spiked Random Features (sRF) model. Further building on recent progress on Gaussian universality (Dandi et al., 2023), w… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:9662-9695, 2024

  24. arXiv:2402.03818  [pdf, other

    cs.LG cond-mat.dis-nn

    Asymptotic generalization error of a single-layer graph convolutional network

    Authors: O. Duranthon, L. Zdeborová

    Abstract: While graph convolutional networks show great practical promises, the theoretical understanding of their generalization properties as a function of the number of samples is still in its infancy compared to the more broadly studied case of supervised fully connected neural networks. In this article, we predict the performances of a single-layer graph convolutional network (GCN) trained on data prod… ▽ More

    Submitted 21 November, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the Third Learning on Graphs Conference (LoG 2024), PMLR 269

  25. Dynamical Phase Transitions in Graph Cellular Automata

    Authors: Freya Behrens, Barbora Hudcová, Lenka Zdeborová

    Abstract: Discrete dynamical systems can exhibit complex behaviour from the iterative application of straightforward local rules. A famous example are cellular automata whose global dynamics are notoriously challenging to analyze. To address this, we relax the regular connectivity grid of cellular automata to a random graph, which gives the class of graph cellular automata. Using the dynamical cavity method… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 15 pages

    Journal ref: Physical Review E 109.4 (2024): 044312

  26. arXiv:2310.02850  [pdf, other

    math.PR cond-mat.dis-nn

    On the Atypical Solutions of the Symmetric Binary Perceptron

    Authors: Damien Barbier, Ahmed El Alaoui, Florent Krzakala, Lenka Zdeborová

    Abstract: We study the random binary symmetric perceptron problem, focusing on the behavior of rare high-margin solutions. While most solutions are isolated, we demonstrate that these rare solutions are part of clusters of extensive entropy, heuristically corresponding to non-trivial fixed points of an approximate message-passing algorithm. We enumerate these clusters via a local entropy, defined as a Franz… ▽ More

    Submitted 28 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 26 pages, 6 figures

    Journal ref: Journal of Physics A: Mathematical and Theoretical 57.19 (2024): 195202

  27. arXiv:2308.14085  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Sampling with flows, diffusion and autoregressive neural networks: A spin-glass perspective

    Authors: Davide Ghio, Yatin Dandi, Florent Krzakala, Lenka Zdeborová

    Abstract: Recent years witnessed the development of powerful generative models based on flows, diffusion or autoregressive neural networks, achieving remarkable success in generating data from examples with applications in a broad range of areas. A theoretical analysis of the performance and understanding of the limitations of these methods remain, however, challenging. In this paper, we undertake a step in… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: 39 pages, 12 figures

    Journal ref: Proceedings of the National Academy of Sciences 121.27 (2024): e2311810121

  28. arXiv:2305.11041  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    High-dimensional Asymptotics of Denoising Autoencoders

    Authors: Hugo Cui, Lenka Zdeborová

    Abstract: We address the problem of denoising data from a Gaussian mixture using a two-layer non-linear autoencoder with tied weights and a skip connection. We consider the high-dimensional limit where the number of training samples and the input dimension jointly tend to infinity while the number of hidden units remains bounded. We provide closed-form expressions for the denoising mean-squared test error.… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Journal ref: Advances in Neural Information Processing Systems 36 (2023)

  29. arXiv:2304.12127  [pdf, other

    cs.IT cond-mat.dis-nn cond-mat.stat-mech

    Compressed sensing with l0-norm: statistical physics analysis and algorithms for signal recovery

    Authors: D. Barbier, C Lucibello, L. Saglietti, F. Krzakala, L. Zdeborova

    Abstract: Noiseless compressive sensing is a protocol that enables undersampling and later recovery of a signal without loss of information. This compression is possible because the signal is usually sufficiently sparse in a given basis. Currently, the algorithm offering the best tradeoff between compression rate, robustness, and speed for compressive sensing is the LASSO (l1-norm bias) algorithm. However,… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Journal ref: Proceedings of IEEE Information Theory Workshop (ITW), pp. 323-328. IEEE, 2023

  30. arXiv:2303.17704  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech

    Bayes-optimal inference for spreading processes on random networks

    Authors: D. Ghio, A. L. M. Aragon, I. Biazzo, L. Zdeborova

    Abstract: We consider a class of spreading processes on networks, which generalize commonly used epidemic models such as the SIR model or the SIS model with a bounded number of re-infections. We analyse the related problem of inference of the dynamics based on its partial observations. We analyse these inference problems on random networks via a message-passing inference algorithm derived from the Belief Pr… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 25 pages, 12 figures

    MSC Class: 82D30 ACM Class: G.3; G.4; I.2

    Journal ref: Physical Review E 108.4 (2023): 044308

  31. Backtracking Dynamical Cavity Method

    Authors: Freya Behrens, Barbora Hudcová, Lenka Zdeborová

    Abstract: The cavity method is one of the cornerstones of the statistical physics of disordered systems such as spin glasses and other complex systems. It is able to analytically and asymptotically exactly describe the equilibrium properties of a broad range of models. Exact solutions for dynamical, out-of-equilibrium properties of disordered systems are traditionally much harder to obtain. Even very basic… ▽ More

    Submitted 8 September, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: 14 pages

    Journal ref: Phys. Rev. X 13 (2023) 31021-31035

  32. arXiv:2303.09995  [pdf, other

    cond-mat.dis-nn cs.SI stat.ML

    Neural-prior stochastic block model

    Authors: O. Duranthon, L. Zdeborová

    Abstract: The stochastic block model (SBM) is widely studied as a benchmark for graph clustering aka community detection. In practice, graph data often come with node attributes that bear additional information about the communities. Previous works modeled such data by considering that the node attributes are generated from the node community memberships. In this work, motivated by a recent surge of works i… ▽ More

    Submitted 6 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Journal ref: Mach. Learn.: Sci. Technol. 4 035017 (2023)

  33. arXiv:2303.05237  [pdf, other

    cond-mat.dis-nn cs.IT

    Statistical mechanics of the maximum-average submatrix problem

    Authors: Vittorio Erba, Florent Krzakala, Rodrigo Pérez, Lenka Zdeborová

    Abstract: We study the maximum-average submatrix problem, in which given an $N \times N$ matrix $J$ one needs to find the $k \times k$ submatrix with the largest average of entries. We study the problem for random matrices $J$ whose entries are i.i.d. random variables by mapping it to a variant of the Sherrington-Kirkpatrick spin-glass model at fixed magnetization. We characterize analytically the phase dia… ▽ More

    Submitted 21 September, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Journal ref: J. Stat. Mech. (2024) 013403

  34. arXiv:2302.00375  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Bayes-optimal Learning of Deep Random Networks of Extensive-width

    Authors: Hugo Cui, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider the problem of learning a target function corresponding to a deep, extensive-width, non-linear neural network with random Gaussian weights. We consider the asymptotic limit where the number of samples, the input dimension and the network width are proportionally large. We propose a closed-form expression for the Bayes-optimal test error, for regression and classification tasks. We furt… ▽ More

    Submitted 21 June, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:6468-6521, 2023

  35. arXiv:2210.08312  [pdf, other

    cond-mat.dis-nn cs.CC math.PR math.ST

    Disordered Systems Insights on Computational Hardness

    Authors: David Gamarnik, Cristopher Moore, Lenka Zdeborová

    Abstract: In this review article, we discuss connections between the physics of disordered systems, phase transitions in inference problems, and computational hardness. We introduce two models representing the behavior of glassy systems, the spiked tensor model and the generalized linear model. We discuss the random (non-planted) versions of these problems as prototypical optimization problems, as well as t… ▽ More

    Submitted 18 October, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: 42 pages

    Journal ref: J. Stat. Mech. (2022) 114015

  36. arXiv:2209.03423  [pdf, other

    cond-mat.dis-nn cs.DM cs.IT math.PR math.ST

    Planted matching problems on random hypergraphs

    Authors: Urte Adomaityte, Anshul Toshniwal, Gabriele Sicuro, Lenka Zdeborová

    Abstract: We consider the problem of inferring a matching hidden in a weighted random $k$-hypergraph. We assume that the hyperedges' weights are random and distributed according to two different densities conditioning on the fact that they belong to the hidden matching, or not. We show that, for $k>2$ and in the large graph size limit, an algorithmic first order transition in the signal strength separates a… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 13 pages, 12 figures

    Journal ref: Phys. Rev. E 106, 054302 (2022)

  37. arXiv:2208.06488  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IR math.PR stat.CO

    The planted XY model: thermodynamics and inference

    Authors: Siyu Chen, Guanhao Huang, Giovanni Piccioli, Lenka Zdeborová

    Abstract: In this paper we study a fully connected planted spin glass named the planted XY model. Motivation for studying this system comes both from the spin glass field and the one of statistical inference where it models the angular synchronization problem. We derive the replica symmetric (RS) phase diagram in the temperature, ferromagnetic bias plane using the approximate message passing (AMP) algorithm… ▽ More

    Submitted 11 January, 2024; v1 submitted 12 August, 2022; originally announced August 2022.

    Comments: 29 pages, 8 figures

    Journal ref: Phys. Rev. E 106, 054115 (2022)

  38. arXiv:2208.05918  [pdf, other

    math.PR cond-mat.dis-nn math.ST

    Low-rank Matrix Estimation with Inhomogeneous Noise

    Authors: Alice Guionnet, Justin Ko, Florent Krzakala, Lenka Zdeborová

    Abstract: We study low-rank matrix estimation for a generic inhomogeneous output channel through which the matrix is observed. This generalizes the commonly considered spiked matrix model with homogeneous noise to include for instance the dense degree-corrected stochastic block model. We adapt techniques used to study multispecies spin glasses to derive and rigorously prove an expression for the free energy… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: 6 figures

    Journal ref: Information and Inference: A Journal of the IMA 14, no. 2 (2025): iaaf010

  39. arXiv:2205.13527  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.PR math.ST

    Subspace clustering in high-dimensions: Phase transitions & Statistical-to-Computational gap

    Authors: Luca Pesce, Bruno Loureiro, Florent Krzakala, Lenka Zdeborová

    Abstract: A simple model to study subspace clustering is the high-dimensional $k$-Gaussian mixture model where the cluster means are sparse vectors. Here we provide an exact asymptotic characterization of the statistically optimal reconstruction error in this model in the high-dimensional regime with extensive sparsity, i.e. when the fraction of non-zero components of the cluster means $ρ$, as well as the r… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: NeurIPS camera-ready version

    Journal ref: Advances in Neural Information Processing Systems (2022), vol 35, pages 27087--27099

  40. arXiv:2205.13303  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.PR math.ST

    Gaussian Universality of Perceptrons with Random Labels

    Authors: Federica Gerace, Florent Krzakala, Bruno Loureiro, Ludovic Stephan, Lenka Zdeborová

    Abstract: While classical in many theoretical settings - and in particular in statistical physics-inspired works - the assumption of Gaussian i.i.d. input data is often perceived as a strong limitation in the context of statistics and machine learning. In this study, we redeem this line of work in the case of generalized linear classification, a.k.a. the perceptron model, with random labels. We argue that t… ▽ More

    Submitted 2 March, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Journal ref: Physical Review E 109.3 (2024): 034305

  41. arXiv:2203.12094  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Learning curves for the multi-class teacher-student perceptron

    Authors: Elisabetta Cornacchia, Francesca Mignacco, Rodrigo Veiga, Cédric Gerbelot, Bruno Loureiro, Lenka Zdeborová

    Abstract: One of the most classical results in high-dimensional learning theory provides a closed-form expression for the generalisation error of binary classification with the single-layer teacher-student perceptron on i.i.d. Gaussian inputs. Both Bayes-optimal estimation and empirical risk minimisation (ERM) were extensively analysed for this setting. At the same time, a considerable part of modern machin… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 14 pages + appendix

    Journal ref: Machine Learning: Science and Technology 4 015019 (2022)

  42. arXiv:2203.07752  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT math.PR

    Optimal denoising of rotationally invariant rectangular matrices

    Authors: Emanuele Troiani, Vittorio Erba, Florent Krzakala, Antoine Maillard, Lenka Zdeborová

    Abstract: In this manuscript we consider denoising of large rectangular matrices: given a noisy observation of a signal matrix, what is the best way of recovering the signal matrix itself? For Gaussian noise and rotationally-invariant signal priors, we completely characterize the optimal denoiser and its performance in the high-dimensional limit, in which the size of the signal matrix goes to infinity with… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of Mathematical and Scientific Machine Learning (MSML), PMLR 190:97-112, 2022

  43. arXiv:2202.10379  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.DM math.PR

    (Dis)assortative Partitions on Random Regular Graphs

    Authors: Freya Behrens, Gabriel Arpino, Yaroslav Kivva, Lenka Zdeborová

    Abstract: We study the problem of assortative and disassortative partitions on random $d$-regular graphs. Nodes in the graph are partitioned into two non-empty groups. In the assortative partition every node requires at least $H$ of their neighbors to be in their own group. In the disassortative partition they require less than $H$ neighbors to be in their own group. Using the cavity method based on analysi… ▽ More

    Submitted 2 May, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: 21 pages; Corrected usage of the world "planted" in Section 4

    Journal ref: J. Phys. A: Math. Theor. 55 395004 (2022)

  44. arXiv:2202.03295  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Theoretical characterization of uncertainty in high-dimensional linear classification

    Authors: Lucas Clarté, Bruno Loureiro, Florent Krzakala, Lenka Zdeborová

    Abstract: Being able to reliably assess not only the \emph{accuracy} but also the \emph{uncertainty} of models' predictions is an important endeavour in modern machine learning. Even if the model generating the data and labels is known, computing the intrinsic uncertainty after learning the model from a limited number of samples amounts to sampling the corresponding posterior probability measure. Such sampl… ▽ More

    Submitted 14 November, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Journal ref: Mach. Learn.: Sci. Technol. 4 025029 (2023)

  45. arXiv:2202.00293  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Phase diagram of Stochastic Gradient Descent in high-dimensional two-layer neural networks

    Authors: Rodrigo Veiga, Ludovic Stephan, Bruno Loureiro, Florent Krzakala, Lenka Zdeborová

    Abstract: Despite the non-convex optimization landscape, over-parametrized shallow networks are able to achieve global convergence under gradient descent. The picture can be radically different for narrow networks, which tend to get stuck in badly-generalizing local minima. Here we investigate the cross-over between these two regimes in the high-dimensional setting, and in particular investigate the connect… ▽ More

    Submitted 14 June, 2023; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: 20 pages

    Journal ref: Advances in Neural Information Processing Systems (2022), vol 35, pages {23244--23255)

  46. arXiv:2112.13079  [pdf, other

    cs.IT cond-mat.dis-nn cs.DM math.PR

    Aligning random graphs with a sub-tree similarity message-passing algorithm

    Authors: Giovanni Piccioli, Guilhem Semerjian, Gabriele Sicuro, Lenka Zdeborová

    Abstract: The problem of aligning Erdös-Rényi random graphs is a noisy, average-case version of the graph isomorphism problem, in which a pair of correlated random graphs is observed through a random permutation of their vertices. We study a polynomial time message-passing algorithm devised to solve the inference problem of partially recovering the hidden permutation, in the sparse regime with constant aver… ▽ More

    Submitted 4 May, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

    Comments: 36 pages, 14 figures, submitted to Journal of Statistical Mechanics: Theory and Experiment. Corrected typos. Modified Figure 1 for clarity. Added references' titles in bibliography. Added definition of "quasi-aligned". Added clarifications about the significance of Nishimori experiments

    Journal ref: J. Stat. Mech. (2022) 063401

  47. arXiv:2110.08775  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT math.PR

    Perturbative construction of mean-field equations in extensive-rank matrix factorization and denoising

    Authors: Antoine Maillard, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: Factorization of matrices where the rank of the two factors diverges linearly with their sizes has many applications in diverse areas such as unsupervised representation learning, dictionary learning or sparse coding. We consider a setting where the two factors are generated from known component-wise independent prior distributions, and the statistician observes a (possibly noisy) component-wise f… ▽ More

    Submitted 8 June, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: 30 pages (main text), 25 pages of references and appendices. v2: Adding clarifications and a new result to derive the optimal denoising estimator from the asymptotic free energy. v3: corrections to match the published version

    Journal ref: J. Stat. Mech. (2022) 083301

  48. Large Deviations of Semi-supervised Learning in the Stochastic Block Model

    Authors: Hugo Cui, Luca Saglietti, Lenka Zdeborová

    Abstract: In community detection on graphs, the semi-supervised learning problem entails inferring the ground-truth membership of each node in a graph, given the connectivity structure and a limited number of revealed node labels. Different subsets of revealed labels can in principle lead to higher or lower information gains and induce different reconstruction accuracies. In the framework of the dense stoch… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Journal ref: Phys. Rev. E 105, 034108 (2022)

  49. arXiv:2106.05418  [pdf, other

    cs.LG cond-mat.dis-nn

    Probing transfer learning with a model of synthetic correlated datasets

    Authors: Federica Gerace, Luca Saglietti, Stefano Sarao Mannelli, Andrew Saxe, Lenka Zdeborová

    Abstract: Transfer learning can significantly improve the sample efficiency of neural networks, by exploiting the relatedness between a data-scarce target task and a data-abundant source task. Despite years of successful applications, transfer learning practice often relies on ad-hoc solutions, while theoretical understanding of these procedures is still limited. In the present work, we re-think a solvable… ▽ More

    Submitted 2 February, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Journal ref: Machine Learning: Science and Technology 3.1 (2022): 015030

  50. arXiv:2106.03791  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

    Authors: Bruno Loureiro, Gabriele Sicuro, Cédric Gerbelot, Alessandro Pacco, Florent Krzakala, Lenka Zdeborová

    Abstract: Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of $K$ Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM… ▽ More

    Submitted 14 December, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: 12 pages + 34 pages of Appendix, 10 figures

    Journal ref: Advances in Neural Information Processing Systems 34 (2021): 10144-10157