Search | arXiv e-print repository

Information-theoretic reduction of deep neural networks to linear models in the overparametrized proportional regime

Authors: Francesco Camilli, Daria Tieplova, Eleonora Bergamin, Jean Barbier

Abstract: We rigorously analyse fully-trained neural networks of arbitrary depth in the Bayesian optimal setting in the so-called proportional scaling regime where the number of training samples and width of the input and all inner layers diverge proportionally. We prove an information-theoretic equivalence between the Bayesian deep neural network model trained from data generated by a teacher with matching… ▽ More We rigorously analyse fully-trained neural networks of arbitrary depth in the Bayesian optimal setting in the so-called proportional scaling regime where the number of training samples and width of the input and all inner layers diverge proportionally. We prove an information-theoretic equivalence between the Bayesian deep neural network model trained from data generated by a teacher with matching architecture, and a simpler model of optimal inference in a generalized linear model. This equivalence enables us to compute the optimal generalization error for deep neural networks in this regime. We thus prove the "deep Gaussian equivalence principle" conjectured in Cui et al. (2023) (arXiv:2302.00375). Our result highlights that in order to escape this "trivialisation" of deep neural networks (in the sense of reduction to a linear model) happening in the strongly overparametrized proportional regime, models trained from much more data have to be considered. △ Less

Submitted 6 May, 2025; originally announced May 2025.

Comments: Accepted to the 38th Annual Conference on Learning Theory (COLT 2025), 41 pages

arXiv:2501.13625 [pdf, other]

Information-theoretic limits and approximate message-passing for high-dimensional time series

Authors: Daria Tieplova, Samriddha Lahiry, Jean Barbier

Abstract: High-dimensional time series appear in many scientific setups, demanding a nuanced approach to model and analyze the underlying dependence structure. Theoretical advancements so far often rely on stringent assumptions regarding the sparsity of the underlying signal. In non-sparse regimes, analyses have primarily focused on linear regression models with the design matrix having independent rows. In… ▽ More High-dimensional time series appear in many scientific setups, demanding a nuanced approach to model and analyze the underlying dependence structure. Theoretical advancements so far often rely on stringent assumptions regarding the sparsity of the underlying signal. In non-sparse regimes, analyses have primarily focused on linear regression models with the design matrix having independent rows. In this paper, we expand the scope by investigating a high-dimensional time series model wherein the number of features grows proportionally to the number of sampling points, without assuming sparsity in the signal. Specifically, we consider the stochastic regression model and derive a single-letter formula for the normalized mutual information between observations and the signal, as well as for minimum mean-square errors. We also empirically study the vector approximate message passing VAMP algorithm and show that, despite the lack of theoretical guarantees, its performance for inference in our time series model is robust and often statistically optimal. △ Less

Submitted 19 March, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

MSC Class: 82Bxx; 62Mxx

arXiv:2405.20993 [pdf, other]

Information limits and Thouless-Anderson-Palmer equations for spiked matrix models with structured noise

Authors: Jean Barbier, Francesco Camilli, Marco Mondelli, Yizhou Xu

Abstract: We consider a prototypical problem of Bayesian inference for a structured spiked model: a low-rank signal is corrupted by additive noise. While both information-theoretic and algorithmic limits are well understood when the noise is a Gaussian Wigner matrix, the more realistic case of structured noise still proves to be challenging. To capture the structure while maintaining mathematical tractabili… ▽ More We consider a prototypical problem of Bayesian inference for a structured spiked model: a low-rank signal is corrupted by additive noise. While both information-theoretic and algorithmic limits are well understood when the noise is a Gaussian Wigner matrix, the more realistic case of structured noise still proves to be challenging. To capture the structure while maintaining mathematical tractability, a line of work has focused on rotationally invariant noise. However, existing studies either provide sub-optimal algorithms or are limited to special cases of noise ensembles. In this paper, using tools from statistical physics (replica method) and random matrix theory (generalized spherical integrals) we establish the first characterization of the information-theoretic limits for a noise matrix drawn from a general trace ensemble. Remarkably, our analysis unveils the asymptotic equivalence between the rotationally invariant model and a surrogate Gaussian one. Finally, we show how to saturate the predicted statistical limits using an efficient algorithm inspired by the theory of adaptive Thouless-Anderson-Palmer (TAP) equations. △ Less

Submitted 8 July, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

MSC Class: 62F15; 82B44

arXiv:2403.07189 [pdf, ps, other]

A multiscale cavity method for sublinear-rank symmetric matrix factorization

Authors: Jean Barbier, Justin Ko, Anas A. Rahman

Abstract: We consider a statistical model for symmetric matrix factorization with additive Gaussian noise in the high-dimensional regime where the rank $M$ of the signal matrix to infer scales with its size $N$ as $M={\rm o}(\sqrt{\ln N})$. Allowing for an $N$-dependent rank offers new challenges and requires new methods. Working in the Bayes-optimal setting, we show that whenever the signal has i.i.d.~entr… ▽ More We consider a statistical model for symmetric matrix factorization with additive Gaussian noise in the high-dimensional regime where the rank $M$ of the signal matrix to infer scales with its size $N$ as $M={\rm o}(\sqrt{\ln N})$. Allowing for an $N$-dependent rank offers new challenges and requires new methods. Working in the Bayes-optimal setting, we show that whenever the signal has i.i.d.~entries, the limiting mutual information between signal and data is given by a variational formula involving a rank-one replica symmetric potential. In other words, from the information-theoretic perspective, the case of a (slowly) growing rank is the same as when $M=1$ (namely, the standard spiked Wigner model). The proof is primarily based on a novel multiscale cavity method allowing for growing rank along with some information-theoretic identities on worst noise for the vector Gaussian channel. We believe that the cavity method developed here will play a role in the analysis of a broader class of inference and spin models where the degrees of freedom are large arrays instead of vectors. △ Less

Submitted 20 March, 2025; v1 submitted 11 March, 2024; originally announced March 2024.

Comments: 64 pages. Filled out proof details, with one step being more involved than initially thought and resulting in changes to the main theorem

arXiv:2307.05635 [pdf, ps, other]

Fundamental limits of overparametrized shallow neural networks for supervised learning

Authors: Francesco Camilli, Daria Tieplova, Jean Barbier

Abstract: We carry out an information-theoretical analysis of a two-layer neural network trained from input-output pairs generated by a teacher network with matching architecture, in overparametrized regimes. Our results come in the form of bounds relating i) the mutual information between training data and network weights, or ii) the Bayes-optimal generalization error, to the same quantities but for a simp… ▽ More We carry out an information-theoretical analysis of a two-layer neural network trained from input-output pairs generated by a teacher network with matching architecture, in overparametrized regimes. Our results come in the form of bounds relating i) the mutual information between training data and network weights, or ii) the Bayes-optimal generalization error, to the same quantities but for a simpler (generalized) linear model for which explicit expressions are rigorously known. Our bounds, which are expressed in terms of the number of training samples, input dimension and number of hidden units, thus yield fundamental performance limits for any neural network (and actually any learning procedure) trained from limited data generated according to our two-layer teacher neural network model. The proof relies on rigorous tools from spin glasses and is guided by ``Gaussian equivalence principles'' lying at the core of numerous recent analyses of neural networks. With respect to the existing literature, which is either non-rigorous or restricted to the case of the learning of the readout weights only, our results are information-theoretic (i.e. are not specific to any learning algorithm) and, importantly, cover a setting where all the network parameters are trained. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: 30 pages, 1 figure

MSC Class: 68Txx; 68T07

arXiv:2302.03306 [pdf, other]

Mismatched estimation of non-symmetric rank-one matrices corrupted by structured noise

Authors: Teng Fu, YuHao Liu, Jean Barbier, Marco Mondelli, ShanSuo Liang, TianQi Hou

Abstract: We study the performance of a Bayesian statistician who estimates a rank-one signal corrupted by non-symmetric rotationally invariant noise with a generic distribution of singular values. As the signal-to-noise ratio and the noise structure are unknown, a Gaussian setup is incorrectly assumed. We derive the exact analytic expression for the error of the mismatched Bayes estimator and also provide… ▽ More We study the performance of a Bayesian statistician who estimates a rank-one signal corrupted by non-symmetric rotationally invariant noise with a generic distribution of singular values. As the signal-to-noise ratio and the noise structure are unknown, a Gaussian setup is incorrectly assumed. We derive the exact analytic expression for the error of the mismatched Bayes estimator and also provide the analysis of an approximate message passing (AMP) algorithm. The first result exploits the asymptotic behavior of spherical integrals for rectangular matrices and of low-rank matrix perturbations; the second one relies on the design and analysis of an auxiliary AMP. The numerical experiments show that there is a performance gap between the AMP and Bayes estimators, which is due to the incorrect estimation of the signal norm. △ Less

Submitted 8 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

arXiv:2205.10009 [pdf, other]

The price of ignorance: how much does it cost to forget noise structure in low-rank matrix estimation?

Authors: Jean Barbier, TianQi Hou, Marco Mondelli, Manuel Sáenz

Abstract: We consider the problem of estimating a rank-1 signal corrupted by structured rotationally invariant noise, and address the following question: how well do inference algorithms perform when the noise statistics is unknown and hence Gaussian noise is assumed? While the matched Bayes-optimal setting with unstructured noise is well understood, the analysis of this mismatched problem is only at its pr… ▽ More We consider the problem of estimating a rank-1 signal corrupted by structured rotationally invariant noise, and address the following question: how well do inference algorithms perform when the noise statistics is unknown and hence Gaussian noise is assumed? While the matched Bayes-optimal setting with unstructured noise is well understood, the analysis of this mismatched problem is only at its premises. In this paper, we make a step towards understanding the effect of the strong source of mismatch which is the noise statistics. Our main technical contribution is the rigorous analysis of a Bayes estimator and of an approximate message passing (AMP) algorithm, both of which incorrectly assume a Gaussian setup. The first result exploits the theory of spherical integrals and of low-rank matrix perturbations; the idea behind the second one is to design and analyze an artificial AMP which, by taking advantage of the flexibility in the denoisers, is able to "correct" the mismatch. Armed with these sharp asymptotic characterizations, we unveil a rich and often unexpected phenomenology. For example, despite AMP is in principle designed to efficiently compute the Bayes estimator, the former is outperformed by the latter in terms of mean-square error. We show that this performance gap is due to an incorrect estimation of the signal norm. In fact, when the SNR is large enough, the overlaps of the AMP and the Bayes estimator coincide, and they even match those of optimal estimators taking into account the structure of the noise. △ Less

Submitted 20 May, 2022; originally announced May 2022.

arXiv:2112.02066 [pdf, ps, other]

Marginals of a spherical spin glass model with correlated disorder

Authors: Jean Barbier, Manuel Sáenz

Abstract: In this paper we prove the weak convergence, in a high-temperature phase, of the finite marginals of the Gibbs measure associated to a symmetric spherical spin glass model with correlated couplings towards an explicit asymptotic decoupled measure. We also provide upper bounds for the rate of convergence in terms of the one of the energy per variable. Furthermore, we establish a concentration inequ… ▽ More In this paper we prove the weak convergence, in a high-temperature phase, of the finite marginals of the Gibbs measure associated to a symmetric spherical spin glass model with correlated couplings towards an explicit asymptotic decoupled measure. We also provide upper bounds for the rate of convergence in terms of the one of the energy per variable. Furthermore, we establish a concentration inequality for bounded functions under a higher temperature condition. These results are exemplified by analysing the asymptotic behaviour of the empirical mean of coordinate-wise functions of samples from the Gibbs measure of the model. △ Less

Submitted 3 December, 2021; originally announced December 2021.

arXiv:2109.06610 [pdf, other]

doi 10.1103/PhysRevE.106.024136

Statistical limits of dictionary learning: random matrix theory and the spectral replica method

Authors: Jean Barbier, Nicolas Macris

Abstract: We consider increasingly complex models of matrix denoising and dictionary learning in the Bayes-optimal setting, in the challenging regime where the matrices to infer have a rank growing linearly with the system size. This is in contrast with most existing literature concerned with the low-rank (i.e., constant-rank) regime. We first consider a class of rotationally invariant matrix denoising prob… ▽ More We consider increasingly complex models of matrix denoising and dictionary learning in the Bayes-optimal setting, in the challenging regime where the matrices to infer have a rank growing linearly with the system size. This is in contrast with most existing literature concerned with the low-rank (i.e., constant-rank) regime. We first consider a class of rotationally invariant matrix denoising problems whose mutual information and minimum mean-square error are computable using techniques from random matrix theory. Next, we analyze the more challenging models of dictionary learning. To do so we introduce a novel combination of the replica method from statistical mechanics together with random matrix theory, coined spectral replica method. This allows us to derive variational formulas for the mutual information between hidden representations and the noisy data of the dictionary learning problem, as well as for the overlaps quantifying the optimal reconstruction error. The proposed method reduces the number of degrees of freedom from $Θ(N^2)$ matrix entries to $Θ(N)$ eigenvalues (or singular values), and yields Coulomb gas representations of the mutual information which are reminiscent of matrix models in physics. The main ingredients are a combination of large deviation results for random matrices together with a new replica symmetric decoupling ansatz at the level of the probability distributions of eigenvalues (or singular values) of certain overlap matrices and the use of HarishChandra-Itzykson-Zuber spherical integrals. △ Less

Submitted 26 February, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

arXiv:2107.06936 [pdf, ps, other]

Performance of Bayesian linear regression in a model with mismatch

Authors: Jean Barbier, Wei-Kuo Chen, Dmitry Panchenko, Manuel Sáenz

Abstract: In this paper we analyze, for a model of linear regression with gaussian covariates, the performance of a Bayesian estimator given by the mean of a log-concave posterior distribution with gaussian prior, in the high-dimensional limit where the number of samples and the covariates' dimension are large and proportional. Although the high-dimensional analysis of Bayesian estimators has been previousl… ▽ More In this paper we analyze, for a model of linear regression with gaussian covariates, the performance of a Bayesian estimator given by the mean of a log-concave posterior distribution with gaussian prior, in the high-dimensional limit where the number of samples and the covariates' dimension are large and proportional. Although the high-dimensional analysis of Bayesian estimators has been previously studied for Bayesian-optimal linear regression where the correct posterior is used for inference, much less is known when there is a mismatch. Here we consider a model in which the responses are corrupted by gaussian noise and are known to be generated as linear combinations of the covariates, but the distributions of the ground-truth regression coefficients and of the noise are unknown. This regression task can be rephrased as a statistical mechanics model known as the Gardner spin glass, an analogy which we exploit. Using a leave-one-out approach we characterize the mean-square error for the regression coefficients. We also derive the log-normalizing constant of the posterior. Similar models have been studied by Shcherbina and Tirozzi and by Talagrand, but our arguments are much more straightforward. An interesting consequence of our analysis is that in the quadratic loss case, the performance of the Bayesian estimator is independent of a global "temperature" hyperparameter and matches the ridge estimator: sampling and optimizing are equally good. △ Less

Submitted 10 November, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

arXiv:2009.12939 [pdf, ps, other]

doi 10.1093/imaiai/iaab027

Strong replica symmetry for high-dimensional disordered log-concave Gibbs measures

Authors: Jean Barbier, Dmitry Panchenko, Manuel Sáenz

Abstract: We consider a generic class of log-concave, possibly random, (Gibbs) measures. We prove the concentration of an infinite family of order parameters called multioverlaps. Because they completely parametrise the quenched Gibbs measure of the system, this implies a simple representation of the asymptotic Gibbs measures, as well as the decoupling of the variables in a strong sense. These results may p… ▽ More We consider a generic class of log-concave, possibly random, (Gibbs) measures. We prove the concentration of an infinite family of order parameters called multioverlaps. Because they completely parametrise the quenched Gibbs measure of the system, this implies a simple representation of the asymptotic Gibbs measures, as well as the decoupling of the variables in a strong sense. These results may prove themselves useful in several contexts. In particular in machine learning and high-dimensional inference, log-concave measures appear in convex empirical risk minimisation, maximum a-posteriori inference or M-estimation. We believe that they may be applicable in establishing some type of "replica symmetric formulas" for the free energy, inference or generalisation error in such settings. △ Less

Submitted 22 February, 2022; v1 submitted 27 September, 2020; originally announced September 2020.

Journal ref: Inf. Inference, 11, no. 3 (2022) 1079-1108

arXiv:2006.07971 [pdf, other]

All-or-nothing statistical and computational phase transitions in sparse spiked matrix estimation

Authors: Jean Barbier, Nicolas Macris, Cynthia Rush

Abstract: We determine statistical and computational limits for estimation of a rank-one matrix (the spike) corrupted by an additive gaussian noise matrix, in a sparse limit, where the underlying hidden vector (that constructs the rank-one matrix) has a number of non-zero components that scales sub-linearly with the total dimension of the vector, and the signal-to-noise ratio tends to infinity at an appropr… ▽ More We determine statistical and computational limits for estimation of a rank-one matrix (the spike) corrupted by an additive gaussian noise matrix, in a sparse limit, where the underlying hidden vector (that constructs the rank-one matrix) has a number of non-zero components that scales sub-linearly with the total dimension of the vector, and the signal-to-noise ratio tends to infinity at an appropriate speed. We prove explicit low-dimensional variational formulas for the asymptotic mutual information between the spike and the observed noisy matrix and analyze the approximate message passing algorithm in the sparse regime. For Bernoulli and Bernoulli-Rademacher distributed vectors, and when the sparsity and signal strength satisfy an appropriate scaling relation, we find all-or-nothing phase transitions for the asymptotic minimum and algorithmic mean-square errors. These jump from their maximum possible value to zero, at well defined signal-to-noise thresholds whose asymptotic values we determine exactly. In the asymptotic regime the statistical-to-algorithmic gap diverges indicating that sparse recovery is hard for approximate message passing. △ Less

Submitted 30 October, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

Comments: Part of this work (in particular the proof of Theorem 1) is already present in reference arXiv:1911.05030

arXiv:2005.03115 [pdf, other]

Strong replica symmetry in high-dimensional optimal Bayesian inference

Authors: Jean Barbier, Dmitry Panchenko

Abstract: We consider generic optimal Bayesian inference, namely, models of signal reconstruction where the posterior distribution and all hyperparameters are known. Under a standard assumption on the concentration of the free energy, we show how replica symmetry in the strong sense of concentration of all multioverlaps can be established as a consequence of the Franz-de Sanctis identities; the identities t… ▽ More We consider generic optimal Bayesian inference, namely, models of signal reconstruction where the posterior distribution and all hyperparameters are known. Under a standard assumption on the concentration of the free energy, we show how replica symmetry in the strong sense of concentration of all multioverlaps can be established as a consequence of the Franz-de Sanctis identities; the identities themselves in the current setting are obtained via a novel perturbation coming from exponentially distributed "side-observations" of the signal. Concentration of multioverlaps means that asymptotically the posterior distribution has a particularly simple structure encoded by a random probability measure (or, in the case of binary signal, a non-random probability measure). We believe that such strong control of the model should be key in the study of inference problems with underlying sparse graphical structure (error correcting codes, block models, etc) and, in particular, in the rigorous derivation of replica symmetric formulas for the free energy and mutual information in this context. △ Less

Submitted 22 February, 2022; v1 submitted 6 May, 2020; originally announced May 2020.

Journal ref: Communications in Mathematical Physics 393, no. 3 (2022) 1199-1239

arXiv:2004.06975 [pdf, ps, other]

doi 10.1109/ISIT44484.2020.9174104

High-dimensional rank-one nonsymmetric matrix decomposition: the spherical case

Authors: Clément Luneau, Nicolas Macris, Jean Barbier

Abstract: We consider the problem of estimating a rank-one nonsymmetric matrix under additive white Gaussian noise. The matrix to estimate can be written as the outer product of two vectors and we look at the special case in which both vectors are uniformly distributed on spheres. We prove a replica-symmetric formula for the average mutual information between these vectors and the observations in the high-d… ▽ More We consider the problem of estimating a rank-one nonsymmetric matrix under additive white Gaussian noise. The matrix to estimate can be written as the outer product of two vectors and we look at the special case in which both vectors are uniformly distributed on spheres. We prove a replica-symmetric formula for the average mutual information between these vectors and the observations in the high-dimensional regime. This goes beyond previous results which considered vectors with independent and identically distributed elements. The method used can be extended to rank-one tensor problems. △ Less

Submitted 15 April, 2020; originally announced April 2020.

Comments: Will appear in 2020 IEEE International Symposium on Information Theory (ISIT). Long version with appendices, 26 pages

arXiv:1911.05030 [pdf, other]

0-1 phase transitions in sparse spiked matrix estimation

Authors: Jean Barbier, Nicolas Macris

Abstract: We consider statistical models of estimation of a rank-one matrix (the spike) corrupted by an additive gaussian noise matrix in the sparse limit. In this limit the underlying hidden vector (that constructs the rank-one matrix) has a number of non-zero components that scales sub-linearly with the total dimension of the vector, and the signal strength tends to infinity at an appropriate speed. We pr… ▽ More We consider statistical models of estimation of a rank-one matrix (the spike) corrupted by an additive gaussian noise matrix in the sparse limit. In this limit the underlying hidden vector (that constructs the rank-one matrix) has a number of non-zero components that scales sub-linearly with the total dimension of the vector, and the signal strength tends to infinity at an appropriate speed. We prove explicit low-dimensional variational formulas for the asymptotic mutual information between the spike and the observed noisy matrix in suitable sparse limits. For Bernoulli and Bernoulli-Rademacher distributed vectors, and when the sparsity and signal strength satisfy an appropriate scaling relation, these formulas imply sharp 0-1 phase transitions for the asymptotic minimum mean-square-error. A similar phase transition was analyzed recently in the context of sparse high-dimensional linear regression (compressive sensing). △ Less

Submitted 12 November, 2019; originally announced November 2019.

arXiv:1907.07103 [pdf, ps, other]

Concentration of the matrix-valued minimum mean-square error in optimal Bayesian inference

Authors: Jean Barbier

Abstract: We consider Bayesian inference of signals with vector-valued entries. Extending concentration techniques from the mathematical physics of spin glasses, we show that the matrix-valued minimum mean-square error concentrates when the size of the problem increases. Such results are often crucial for proving single-letter formulas for the mutual information when they exist. Our proof is valid in the op… ▽ More We consider Bayesian inference of signals with vector-valued entries. Extending concentration techniques from the mathematical physics of spin glasses, we show that the matrix-valued minimum mean-square error concentrates when the size of the problem increases. Such results are often crucial for proving single-letter formulas for the mutual information when they exist. Our proof is valid in the optimal Bayesian inference setting, meaning that it relies on the assumption that the model and all its hyper-parameters are known. Examples of inference and learning problems covered by our results are spiked matrix and tensor models, the committee machine neural network with few hidden neurons in the teacher-student scenario, or multi-layers generalized linear models. △ Less

Submitted 15 July, 2019; originally announced July 2019.

Comments: arXiv admin note: text overlap with arXiv:1904.02808

arXiv:1904.02808 [pdf, other]

Overlap matrix concentration in optimal Bayesian inference

Authors: Jean Barbier

Abstract: We consider models of Bayesian inference of signals with vectorial components of finite dimensionality. We show that, under a proper perturbation, these models are replica symmetric in the sense that the overlap matrix concentrates. The overlap matrix is the order parameter in these models and is directly related to error metrics such as minimum mean-square errors. Our proof is valid in the optima… ▽ More We consider models of Bayesian inference of signals with vectorial components of finite dimensionality. We show that, under a proper perturbation, these models are replica symmetric in the sense that the overlap matrix concentrates. The overlap matrix is the order parameter in these models and is directly related to error metrics such as minimum mean-square errors. Our proof is valid in the optimal Bayesian inference setting. This means that it relies on the assumption that the model and all its hyper-parameters are known so that the posterior distribution can be written exactly. Examples of important problems in high-dimensional inference and learning to which our results apply are low-rank tensor factorization, the committee machine neural network with a finite number of hidden neurons in the teacher-student scenario, or multi-layer versions of the generalized linear model. △ Less

Submitted 24 January, 2020; v1 submitted 4 April, 2019; originally announced April 2019.

arXiv:1902.07273 [pdf, other]

Mutual Information for the Stochastic Block Model by the Adaptive Interpolation Method

Authors: Jean Barbier, Chun Lam Chan, Nicolas Macris

Abstract: We rigorously derive a single-letter variational expression for the mutual information of the asymmetric two-groups stochastic block model in the dense graph regime. Existing proofs in the literature are indirect, as they involve mapping the model to a rank-one matrix estimation problem whose mutual information is then determined by a combination of methods (e.g., interpolation, cavity, algorithmi… ▽ More We rigorously derive a single-letter variational expression for the mutual information of the asymmetric two-groups stochastic block model in the dense graph regime. Existing proofs in the literature are indirect, as they involve mapping the model to a rank-one matrix estimation problem whose mutual information is then determined by a combination of methods (e.g., interpolation, cavity, algorithmic, spatial coupling). In this contribution we provide a self-contained direct method using only the recently introduced adaptive interpolation method. △ Less

Submitted 16 July, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

arXiv:1901.06521 [pdf, other]

doi 10.1007/s10955-019-02470-6

Concentration of multi-overlaps for random ferromagnetic spin models

Authors: Jean Barbier, Chun Lam Chan, Nicolas Macris

Abstract: We consider ferromagnetic spin models on dilute random graphs and prove that, with suitable one-body infinitesimal perturbations added to the Hamiltonian, the multi-overlaps concentrate for all temperatures, both with respect to the thermal Gibbs average and the quenched randomness. Results of this nature have been known only for the lowest order overlaps, at high temperature or on the Nishimori l… ▽ More We consider ferromagnetic spin models on dilute random graphs and prove that, with suitable one-body infinitesimal perturbations added to the Hamiltonian, the multi-overlaps concentrate for all temperatures, both with respect to the thermal Gibbs average and the quenched randomness. Results of this nature have been known only for the lowest order overlaps, at high temperature or on the Nishimori line. Here we treat all multi-overlaps by a non-trivial application of Griffiths-Kelly-Sherman correlation inequalities. Our results apply in particular to the pure and mixed p-spin ferromagnets on random dilute Erdoes-Rényi hypergraphs. On physical grounds one expects that multi-overlap concentration directly implies the correctness of the cavity (or replica symmetric) formula for the pressure. The proof of this formula for the general p-spin ferromagnet on a random dilute hypergraph remains an open problem. △ Less

Submitted 19 January, 2019; originally announced January 2019.

arXiv:1901.06516 [pdf, ps, other]

doi 10.1088/1751-8121/ab2735

The adaptive interpolation method for proving replica formulas. Applications to the Curie-Weiss and Wigner spike models

Authors: Jean Barbier, Nicolas Macris

Abstract: In this contribution we give a pedagogic introduction to the newly introduced adaptive interpolation method to prove in a simple and unified way replica formulas for Bayesian optimal inference problems. Many aspects of this method can already be explained at the level of the simple Curie-Weiss spin system. This provides a new method of solution for this model which does not appear to be known. We… ▽ More In this contribution we give a pedagogic introduction to the newly introduced adaptive interpolation method to prove in a simple and unified way replica formulas for Bayesian optimal inference problems. Many aspects of this method can already be explained at the level of the simple Curie-Weiss spin system. This provides a new method of solution for this model which does not appear to be known. We then generalize this analysis to a paradigmatic inference problem, namely rank-one matrix estimation, also refered to as the Wigner spike model in statistics. We give many pointers to the recent literature where the method has been succesfully applied. △ Less

Submitted 7 March, 2020; v1 submitted 19 January, 2019; originally announced January 2019.

arXiv:1401.1700 [pdf, ps, other]

A Corollary of Hamada and Ohmori's on Group Law over BIBD

Authors: Arnaud Bannier, Johann Barbier, Eric Filiol, Pierre Castel

Abstract: In this note, we present an interesting corollary of a theorem of Hamada and Ohmori. We prove that the complementary of PG(n,2) is the only design, up to an isomorphism, whose blocks form a group for the symmetric difference. In this note, we present an interesting corollary of a theorem of Hamada and Ohmori. We prove that the complementary of PG(n,2) is the only design, up to an isomorphism, whose blocks form a group for the symmetric difference. △ Less

Submitted 8 January, 2014; originally announced January 2014.

Comments: 4 pages

arXiv:1207.2079 [pdf, other]

doi 10.1109/Allerton.2012.6483300

Compressed Sensing of Approximately-Sparse Signals: Phase Transitions and Optimal Reconstruction

Authors: Jean Barbier, Florent Krzakala, Marc Mézard, Lenka Zdeborová

Abstract: Compressed sensing is designed to measure sparse signals directly in a compressed form. However, most signals of interest are only "approximately sparse", i.e. even though the signal contains only a small fraction of relevant (large) components the other components are not strictly equal to zero, but are only close to zero. In this paper we model the approximately sparse signal with a Gaussian dis… ▽ More Compressed sensing is designed to measure sparse signals directly in a compressed form. However, most signals of interest are only "approximately sparse", i.e. even though the signal contains only a small fraction of relevant (large) components the other components are not strictly equal to zero, but are only close to zero. In this paper we model the approximately sparse signal with a Gaussian distribution of small components, and we study its compressed sensing with dense random matrices. We use replica calculations to determine the mean-squared error of the Bayes-optimal reconstruction for such signals, as a function of the variance of the small components, the density of large components and the measurement rate. We then use the G-AMP algorithm and we quantify the region of parameters for which this algorithm achieves optimality (for large systems). Finally, we show that in the region where the GAMP for the homogeneous measurement matrices is not optimal, a special "seeding" design of a spatially-coupled measurement matrix allows to restore optimality. △ Less

Submitted 9 July, 2012; originally announced July 2012.

Comments: 8 pages, 10 figures

Journal ref: Communication, Control, and Computing (Allerton), 2012 50th Annual Allerton Conference on , pp.800,807, 1-5 Oct. 2012

Showing 1–22 of 22 results for author: Barbier, J