Skip to main content

Showing 1–44 of 44 results for author: Mezard, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17638  [pdf, ps, other

    cs.LG cond-mat.dis-nn stat.ML

    Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training

    Authors: Tony Bonnaire, Raphaël Urfin, Giulio Biroli, Marc Mézard

    Abstract: Diffusion models have achieved remarkable success across a wide range of generative tasks. A key challenge is understanding the mechanisms that prevent their memorization of training data and allow generalization. In this work, we investigate the role of the training dynamics in the transition from generalization to memorization. Through extensive experiments and theoretical analysis, we identify… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 36 pages, 15 figures

  2. arXiv:2502.07849  [pdf, other

    cs.LG cs.AI stat.ML

    Classifier-Free Guidance: From High-Dimensional Analysis to Generalized Guidance Forms

    Authors: Krunoslav Lehman Pavasovic, Jakob Verbeek, Giulio Biroli, Marc Mezard

    Abstract: Classifier-Free Guidance (CFG) is a widely adopted technique in diffusion and flow-based generative models, enabling high-quality conditional generation. A key theoretical challenge is characterizing the distribution induced by CFG, particularly in high-dimensional settings relevant to real-world data. Previous works have shown that CFG modifies the target distribution, steering it towards a distr… ▽ More

    Submitted 22 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  3. arXiv:2501.00988  [pdf, other

    cs.LG

    Optimizing Noise Schedules of Generative Models in High Dimensionss

    Authors: Santiago Aranguri, Giulio Biroli, Marc Mezard, Eric Vanden-Eijnden

    Abstract: Recent works have shown that diffusion models can undergo phase transitions, the resolution of which is needed for accurately generating samples. This has motivated the use of different noise schedules, the two most common choices being referred to as variance preserving (VP) and variance exploding (VE). Here we revisit these schedules within the framework of stochastic interpolants. Using the Gau… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

  4. arXiv:2408.15138  [pdf, ps, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech cs.CL

    How transformers learn structured data: insights from hierarchical filtering

    Authors: Jerome Garnier-Brun, Marc Mézard, Emanuele Moscato, Luca Saglietti

    Abstract: Understanding the learning process and the embedded computation in transformers is becoming a central goal for the development of interpretable AI. In the present study, we introduce a hierarchical filtering procedure for data models of sequences on trees, allowing us to hand-tune the range of positional correlations in the data. Leveraging this controlled setting, we provide evidence that vanilla… ▽ More

    Submitted 10 June, 2025; v1 submitted 27 August, 2024; originally announced August 2024.

    Comments: 17 pages, 12 figures

  5. arXiv:2408.05807  [pdf, other

    cs.LG cond-mat.dis-nn math.ST stat.ML

    Kernel Density Estimators in Large Dimensions

    Authors: Giulio Biroli, Marc Mézard

    Abstract: This paper studies Kernel Density Estimation for a high-dimensional distribution $ρ(x)$. Traditional approaches have focused on the limit of large number of data points $n$ and fixed dimension $d$. We analyze instead the regime where both the number $n$ of data points $y_i$ and their dimensionality $d$ grow with a fixed ratio $α=(\log n)/d$. Our study reveals three distinct statistical regimes for… ▽ More

    Submitted 18 October, 2024; v1 submitted 11 August, 2024; originally announced August 2024.

  6. arXiv:2402.18491  [pdf, other

    cs.LG cond-mat.stat-mech

    Dynamical Regimes of Diffusion Models

    Authors: Giulio Biroli, Tony Bonnaire, Valentin de Bortoli, Marc Mézard

    Abstract: Using statistical physics methods, we study generative diffusion models in the regime where the dimension of space and the number of data are large, and the score function has been trained optimally. Our analysis reveals three distinct dynamical regimes during the backward generative diffusion process. The generative dynamics, starting from pure noise, encounters first a 'speciation' transition wh… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 22 pages, 11 figures

    Journal ref: Nature Communications 15, 9957 (2024)

  7. arXiv:2308.13445  [pdf, other

    cond-mat.dis-nn cs.NE

    Eigenvector Dreaming

    Authors: Marco Benedetti, Louis Carillo, Enzo Marinari, Marc Mèzard

    Abstract: Among the performance-enhancing procedures for Hopfield-type networks that implement associative memory, Hebbian Unlearning (or dreaming) strikes for its simplicity and its clear biological interpretation. Yet, it does not easily lend itself to a clear analytical understanding. Here we show how Hebbian Unlearning can be effectively described in terms of a simple evolution of the spectrum and the e… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  8. arXiv:2307.16564  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT cs.LG

    The Decimation Scheme for Symmetric Matrix Factorization

    Authors: Francesco Camilli, Marc Mézard

    Abstract: Matrix factorization is an inference problem that has acquired importance due to its vast range of applications that go from dictionary learning to recommendation systems and machine learning with deep networks. The study of its fundamental statistical limits represents a true challenge, and despite a decade-long history of efforts in the community, there is still no closed formula able to describ… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 30 pages, 13 figures

  9. arXiv:2306.16097  [pdf, other

    cond-mat.stat-mech cs.IT cs.LG stat.ML

    Sparse Representations, Inference and Learning

    Authors: Clarissa Lauditi, Emanuele Troiani, Marc Mézard

    Abstract: In recent years statistical physics has proven to be a valuable tool to probe into large dimensional inference problems such as the ones occurring in machine learning. Statistical physics provides analytical tools to study fundamental limitations in their solutions and proposes algorithms to solve individual instances. In these notes, based on the lectures by Marc Mézard in 2022 at the summer scho… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  10. arXiv:2304.14964  [pdf, other

    cond-mat.dis-nn cs.IT

    The Exponential Capacity of Dense Associative Memories

    Authors: Carlo Lucibello, Marc Mézard

    Abstract: Recent generalizations of the Hopfield model of associative memories are able to store a number $P$ of random patterns that grows exponentially with the number $N$ of neurons, $P=\exp(αN)$. Besides the huge storage capacity, another interesting feature of these networks is their connection to the attention mechanism which is part of the Transformer architectures widely applied in deep learning. In… ▽ More

    Submitted 22 January, 2024; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: Version accepted on Physics Review Letters

    Journal ref: Phys. Rev. Lett. 132 (2024)

  11. arXiv:2212.02105  [pdf, other

    cond-mat.dis-nn cs.LG

    Matrix factorization with neural networks

    Authors: Francesco Camilli, Marc Mézard

    Abstract: Matrix factorization is an important mathematical problem encountered in the context of dictionary learning, recommendation systems and machine learning. We introduce a new `decimation' scheme that maps it to neural network models of associative memory and provide a detailed theoretical analysis of its performance, showing that decimation is able to factorize extensive-rank matrices and to denoise… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: 13 pages, 6 figures

  12. arXiv:2110.08775  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT math.PR

    Perturbative construction of mean-field equations in extensive-rank matrix factorization and denoising

    Authors: Antoine Maillard, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: Factorization of matrices where the rank of the two factors diverges linearly with their sizes has many applications in diverse areas such as unsupervised representation learning, dictionary learning or sparse coding. We consider a setting where the two factors are generated from known component-wise independent prior distributions, and the statistician observes a (possibly noisy) component-wise f… ▽ More

    Submitted 8 June, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: 30 pages (main text), 25 pages of references and appendices. v2: Adding clarifications and a new result to derive the optimal denoising estimator from the asymptotic free energy. v3: corrections to match the published version

    Journal ref: J. Stat. Mech. (2022) 083301

  13. arXiv:2102.08127  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.PR math.ST

    Learning curves of generic features maps for realistic datasets with a teacher-student model

    Authors: Bruno Loureiro, Cédric Gerbelot, Hugo Cui, Sebastian Goldt, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: Teacher-student models provide a framework in which the typical-case performance of high-dimensional supervised learning can be described in closed form. The assumptions of Gaussian i.i.d. input data underlying the canonical teacher-student model may, however, be perceived as too restrictive to capture the behaviour of realistic data sets. In this paper, we introduce a Gaussian covariate generalis… ▽ More

    Submitted 14 December, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: v3: NeurIPS camera-ready

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), vol 34 p10137--18151. J. Stat. Mech. (2022) 114001

  14. arXiv:2009.09422  [pdf, other

    q-bio.PE cond-mat.stat-mech cs.AI cs.LG

    Epidemic mitigation by statistical inference from contact tracing data

    Authors: Antoine Baker, Indaco Biazzo, Alfredo Braunstein, Giovanni Catania, Luca Dall'Asta, Alessandro Ingrosso, Florent Krzakala, Fabio Mazza, Marc Mézard, Anna Paola Muntoni, Maria Refinetti, Stefano Sarao Mannelli, Lenka Zdeborová

    Abstract: Contact-tracing is an essential tool in order to mitigate the impact of pandemic such as the COVID-19. In order to achieve efficient and scalable contact-tracing in real time, digital devices can play an important role. While a lot of attention has been paid to analyzing the privacy and ethical risks of the associated mobile applications, so far much less research has been devoted to optimizing th… ▽ More

    Submitted 20 September, 2020; originally announced September 2020.

    Comments: 21 pages, 7 figures

    ACM Class: G.3; G.4; I.2.11; J.3

    Journal ref: PNAS 2021 Vol. 118 No. 32 e2106548118

  15. arXiv:2006.14709  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    The Gaussian equivalence of generative models for learning with shallow neural networks

    Authors: Sebastian Goldt, Bruno Loureiro, Galen Reeves, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: Understanding the impact of data structure on the computational tractability of learning is a key challenge for the theory of neural networks. Many theoretical works do not explicitly model training data, or assume that inputs are drawn component-wise independently from some simple probability distribution. Here, we go beyond this simple paradigm by studying the performance of neural networks trai… ▽ More

    Submitted 21 May, 2021; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: The accompanying code for this paper is available at https://github.com/sgoldt/gaussian-equiv-2layer

    Journal ref: Proceedings of the 2nd Mathematical and Scientific Machine Learning Conference, PMLR 145:426-471 (2021)

  16. arXiv:2002.09339  [pdf, other

    math.ST cs.LG math.PR stat.ML

    Generalisation error in learning with random features and the hidden manifold model

    Authors: Federica Gerace, Bruno Loureiro, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: We study generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden manifold model. We consider the high-dimensional regime and using the replica method from statistical physics, we provide a closed-form expression for the asymp… ▽ More

    Submitted 20 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: v2: ICML 2020 camera-ready

    Journal ref: J. Stat. Mech. 2021 124013 & ICML 2020

  17. arXiv:1909.11500  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Modelling the influence of data structure on learning in neural networks: the hidden manifold model

    Authors: Sebastian Goldt, Marc Mézard, Florent Krzakala, Lenka Zdeborová

    Abstract: Understanding the reasons for the success of deep neural networks trained using stochastic gradient-based methods is a key open problem for the nascent theory of deep learning. The types of data where these networks are most successful, such as images or sequences of speech, are characterised by intricate correlations. Yet, most theoretical work on neural networks does not explicitly model trainin… ▽ More

    Submitted 3 December, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Journal ref: Physical Review X, Vol. 10, No. 4 (2020)

  18. arXiv:1906.08479  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT math.PR

    High-temperature Expansions and Message Passing Algorithms

    Authors: Antoine Maillard, Laura Foini, Alejandro Lage Castellanos, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: Improved mean-field technics are a central theme of statistical physics methods applied to inference and learning. We revisit here some of these methods using high-temperature expansions for disordered systems initiated by Plefka, Georges and Yedidia. We derive the Gibbs free entropy and the subsequent self-consistent equations for a generic class of statistical models with correlated matrices and… ▽ More

    Submitted 10 June, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: 59 pages, updated version matching the version published in J. Stat. Mech. Correction of typos in the last version

    Journal ref: J. Stat. Mech. (2019) 113301

  19. arXiv:1701.06981  [pdf, other

    cs.IT cond-mat.stat-mech stat.ML

    Multi-Layer Generalized Linear Estimation

    Authors: Andre Manoel, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: We consider the problem of reconstructing a signal from multi-layered (possibly) non-linear measurements. Using non-rigorous but standard methods from statistical physics we present the Multi-Layer Approximate Message Passing (ML-AMP) algorithm for computing marginal probabilities of the corresponding estimation problem and derive the associated state evolution equations to analyze its performance… ▽ More

    Submitted 24 January, 2017; originally announced January 2017.

    Comments: 5 pages, 1 figure

    Journal ref: IEEE International Symposium on Information Theory (ISIT), pages 2098-2102 (2017)

  20. arXiv:1407.1255  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.SI physics.soc-ph

    Dynamic message-passing equations for models with unidirectional dynamics

    Authors: Andrey Y. Lokhov, Marc Mézard, Lenka Zdeborová

    Abstract: Understanding and quantifying the dynamics of disordered out-of-equilibrium models is an important problem in many branches of science. Using the dynamic cavity method on time trajectories, we construct a general procedure for deriving the dynamic message-passing equations for a large class of models with unidirectional dynamics, which includes the zero-temperature random field Ising model, the su… ▽ More

    Submitted 14 January, 2015; v1 submitted 4 July, 2014; originally announced July 2014.

    Comments: Final version

    Journal ref: Phys. Rev. E 91, 012811 (2015)

  21. arXiv:1402.1298  [pdf, other

    math.NA cond-mat.stat-mech cs.IT cs.LG stat.ML

    Phase transitions and sample complexity in Bayes-optimal matrix factorization

    Authors: Yoshiyuki Kabashima, Florent Krzakala, Marc Mézard, Ayaka Sakata, Lenka Zdeborová

    Abstract: We analyse the matrix factorization problem. Given a noisy measurement of a product of two matrices, the problem is to estimate back the original matrices. It arises in many applications such as dictionary learning, blind matrix calibration, sparse principal component analysis, blind source separation, low rank matrix completion, robust principal component analysis or factor analysis. It is also i… ▽ More

    Submitted 21 March, 2016; v1 submitted 6 February, 2014; originally announced February 2014.

    Comments: 50 pages, 10 figures

    Journal ref: IEEE Transactions on Information Theory (Volume:62 , Issue: 7, Pages: 4228 - 4265) 2016

  22. arXiv:1303.5315  [pdf, other

    physics.soc-ph cond-mat.stat-mech cs.SI q-bio.PE

    Inferring the origin of an epidemic with a dynamic message-passing algorithm

    Authors: Andrey Y. Lokhov, Marc Mézard, Hiroki Ohta, Lenka Zdeborová

    Abstract: We study the problem of estimating the origin of an epidemic outbreak -- given a contact network and a snapshot of epidemic spread at a certain time, determine the infection source. Finding the source is important in different contexts of computer or social networks. We assume that the epidemic spread follows the most commonly used susceptible-infected-recovered model. We introduce an inference al… ▽ More

    Submitted 2 July, 2014; v1 submitted 21 March, 2013; originally announced March 2013.

    Comments: 9 pages, 8 figures. Revised version, new figures added

    Journal ref: Phys. Rev. E 90, 012801 (2014)

  23. arXiv:1302.0189  [pdf, other

    cs.IT cond-mat.stat-mech q-bio.GN q-bio.QM

    Non-adaptive pooling strategies for detection of rare faulty items

    Authors: Pan Zhang, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: We study non-adaptive pooling strategies for detection of rare faulty items. Given a binary sparse N-dimensional signal x, how to construct a sparse binary MxN pooling matrix F such that the signal can be reconstructed from the smallest possible number M of measurements y=Fx? We show that a very low number of measurements is possible for random spatially coupled design of pools F. Our design might… ▽ More

    Submitted 1 February, 2013; originally announced February 2013.

    Comments: 5 pages

    Journal ref: IEEE International Conference on Communications Workshops (ICC 2013), Pages: 1409 - 1414, (2013)

  24. arXiv:1301.5898  [pdf, other

    cs.IT cond-mat.stat-mech cs.LG

    Phase Diagram and Approximate Message Passing for Blind Calibration and Dictionary Learning

    Authors: Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: We consider dictionary learning and blind calibration for signals and matrices created from a random ensemble. We study the mean-squared error in the limit of large signal dimension using the replica method and unveil the appearance of phase transitions delimiting impossible, possible-but-hard and possible inference regions. We also introduce an approximate message passing algorithm that asymptoti… ▽ More

    Submitted 24 January, 2013; originally announced January 2013.

    Comments: 5 pages

    Journal ref: Information Theory Proceedings (ISIT), 2013 IEEE International Symposium on, page(s) 659 - 663

  25. arXiv:1301.0901  [pdf, ps, other

    cs.IT cond-mat.stat-mech math.ST

    Compressed Sensing under Matrix Uncertainty: Optimum Thresholds and Robust Approximate Message Passing

    Authors: Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: In compressed sensing one measures sparse signals directly in a compressed form via a linear transform and then reconstructs the original signal. However, it is often the case that the linear transform itself is known only approximately, a situation called matrix uncertainty, and that the measurement process is noisy. Here we present two contributions to this problem: first, we use the replica met… ▽ More

    Submitted 5 January, 2013; originally announced January 2013.

    Comments: 5 pages, 4 figures

    Journal ref: Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pages 5519 - 5523

  26. arXiv:1211.2379  [pdf, ps, other

    math.NA cond-mat.stat-mech cs.IT

    Belief Propagation Reconstruction for Discrete Tomography

    Authors: Emmanuelle Gouillart, Florent Krzakala, Marc Mezard, Lenka Zdeborová

    Abstract: We consider the reconstruction of a two-dimensional discrete image from a set of tomographic measurements corresponding to the Radon projection. Assuming that the image has a structure where neighbouring pixels have a larger probability to take the same value, we follow a Bayesian approach and introduce a fast message-passing reconstruction algorithm based on belief propagation. For numerical resu… ▽ More

    Submitted 3 April, 2013; v1 submitted 11 November, 2012; originally announced November 2012.

    Journal ref: Inverse Problems 29, 3 (2013) 035003

  27. arXiv:1207.2079  [pdf, other

    cs.IT cond-mat.stat-mech math.ST

    Compressed Sensing of Approximately-Sparse Signals: Phase Transitions and Optimal Reconstruction

    Authors: Jean Barbier, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: Compressed sensing is designed to measure sparse signals directly in a compressed form. However, most signals of interest are only "approximately sparse", i.e. even though the signal contains only a small fraction of relevant (large) components the other components are not strictly equal to zero, but are only close to zero. In this paper we model the approximately sparse signal with a Gaussian dis… ▽ More

    Submitted 9 July, 2012; originally announced July 2012.

    Comments: 8 pages, 10 figures

    Journal ref: Communication, Control, and Computing (Allerton), 2012 50th Annual Allerton Conference on , pp.800,807, 1-5 Oct. 2012

  28. arXiv:1206.3953  [pdf, other

    cond-mat.stat-mech cs.IT

    Probabilistic Reconstruction in Compressed Sensing: Algorithms, Phase Diagrams, and Threshold Achieving Matrices

    Authors: Florent Krzakala, Marc Mézard, François Sausset, Yifan Sun, Lenka Zdeborová

    Abstract: Compressed sensing is a signal processing method that acquires data directly in a compressed form. This allows one to make less measurements than what was considered necessary to record a signal, enabling faster or more precise measurement protocols in a wide range of applications. Using an interdisciplinary approach, we have recently proposed in [arXiv:1109.4424] a strategy that allows compressed… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: 42 pages, 37 figures, 3 appendixes

    Journal ref: J. Stat. Mech. (2012) P08009

  29. arXiv:1109.4424  [pdf, other

    cond-mat.stat-mech cs.IT

    Statistical physics-based reconstruction in compressed sensing

    Authors: Florent Krzakala, Marc Mézard, François Sausset, Yifan Sun, Lenka Zdeborová

    Abstract: Compressed sensing is triggering a major evolution in signal acquisition. It consists in sampling a sparse signal at low rate and later using computational power for its exact reconstruction, so that only the necessary information is measured. Currently used reconstruction techniques are, however, limited to acquisition rates larger than the true density of the signal. We design a new procedure wh… ▽ More

    Submitted 6 June, 2012; v1 submitted 20 September, 2011; originally announced September 2011.

    Comments: 20 pages, 8 figures, 3 tables. Related codes and data are available at http://aspics.krzakala.org

    Journal ref: Phys. Rev. X 2, 021005 (2012)

  30. arXiv:0908.1599  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.CC

    Decimation flows in constraint satisfaction problems

    Authors: Saburo Higuchi, Marc Mézard

    Abstract: We study hard constraint satisfaction problems with a decimation approach based on message passing algorithms. Decimation induces a renormalization flow in the space of problems, and we exploit the fact that this flow transforms some of the constraints into linear constraints over GF(2). In particular, when the flow hits the subspace of linear problems, one can stop decimation and use Gaussian e… ▽ More

    Submitted 11 August, 2009; originally announced August 2009.

    Comments: 14 pages, 2 figures

    Journal ref: J. Stat. Mech. (2009) P12009

  31. arXiv:0903.1621  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT

    Susceptibility Propagation for Constraint Satisfaction Problems

    Authors: Saburo Higuchi, Marc Mézard

    Abstract: We study the susceptibility propagation, a message-passing algorithm to compute correlation functions. It is applied to constraint satisfaction problems and its accuracy is examined. As a heuristic method to find a satisfying assignment, we propose susceptibility-guided decimation where correlations among the variables play an important role. We apply this novel decimation to locked occupation p… ▽ More

    Submitted 9 March, 2009; originally announced March 2009.

    Comments: 17 pages, 5 figures

    Journal ref: J. Phys.: Conf. Ser. 233(2010)012003

  32. arXiv:0810.1499  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.CC cs.DS

    Constraint satisfaction problems with isolated solutions are hard

    Authors: Lenka Zdeborová, Marc Mézard

    Abstract: We study the phase diagram and the algorithmic hardness of the random `locked' constraint satisfaction problems, and compare them to the commonly studied 'non-locked' problems like satisfiability of boolean formulas or graph coloring. The special property of the locked problems is that clusters of solutions are isolated points. This simplifies significantly the determination of the phase diagram… ▽ More

    Submitted 4 December, 2008; v1 submitted 8 October, 2008; originally announced October 2008.

    Comments: 19 pages, 12 figures

    Journal ref: J. Stat. Mech. (2008) P12004

  33. arXiv:0803.2955  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn cs.CC

    Locked constraint satisfaction problems

    Authors: Lenka Zdeborová, Marc Mézard

    Abstract: We introduce and study the random "locked" constraint satisfaction problems. When increasing the density of constraints, they display a broad "clustered" phase in which the space of solutions is divided into many isolated points. While the phase diagram can be found easily, these problems, in their clustered phase, are extremely hard from the algorithmic point of view: the best known algorithms… ▽ More

    Submitted 5 September, 2008; v1 submitted 20 March, 2008; originally announced March 2008.

    Comments: 4 pages, 2 figures

    Journal ref: Phys. Rev. Lett. 101, 078702 (2008)

  34. arXiv:0706.3104  [pdf, ps, other

    cs.DS cond-mat.dis-nn cond-mat.stat-mech cs.IT

    Group Testing with Random Pools: optimal two-stage algorithms

    Authors: Marc Mezard, Cristina Toninelli

    Abstract: We study Probabilistic Group Testing of a set of N items each of which is defective with probability p. We focus on the double limit of small defect probability, p<<1, and large number of variables, N>>1, taking either p->0 after $N\to\infty$ or $p=1/N^β$ with $β\in(0,1/2)$. In both settings the optimal number of tests which are required to identify with certainty the defectives via a two-stage… ▽ More

    Submitted 21 June, 2007; originally announced June 2007.

    Comments: 12 pages

  35. Geometrical organization of solutions to random linear Boolean equations

    Authors: Thierry Mora, Marc Mézard

    Abstract: The random XORSAT problem deals with large random linear systems of Boolean variables. The difficulty of such problems is controlled by the ratio of number of equations to number of variables. It is known that in some range of values of this parameter, the space of solutions breaks into many disconnected clusters. Here we study precisely the corresponding geometrical organization. In particular,… ▽ More

    Submitted 5 September, 2006; originally announced September 2006.

    Comments: 20 pages

    Journal ref: Journal of Statistical Mechanics: Theory and Experiment (2006) P10007

  36. arXiv:cond-mat/0603350  [pdf, ps, other

    cond-mat.dis-nn cs.CC math.CO

    The number of matchings in random graphs

    Authors: Lenka Zdeborová, Marc Mézard

    Abstract: We study matchings on sparse random graphs by means of the cavity method. We first show how the method reproduces several known results about maximum and perfect matchings in regular and Erdos-Renyi random graphs. Our main new result is the computation of the entropy, i.e. the leading order of the logarithm of the number of solutions, of matchings with a given size. We derive both an algorithm t… ▽ More

    Submitted 5 May, 2006; v1 submitted 13 March, 2006; originally announced March 2006.

    Comments: 17 pages, 6 figures, to be published in Journal of Statistical Mechanics

    Journal ref: J. Stat. Mech. (2006) P05003

  37. arXiv:cond-mat/0507451  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.CC

    Landscape of solutions in constraint satisfaction problems

    Authors: Marc Mezard, Matteo Palassini, Olivier Rivoire

    Abstract: We present a theoretical framework for characterizing the geometrical properties of the space of solutions in constraint satisfaction problems, together with practical algorithms for studying this structure on particular instances. We apply our method to the coloring problem, for which we obtain the total number of solutions and analyze in detail the distribution of distances between solutions.

    Submitted 2 November, 2005; v1 submitted 19 July, 2005; originally announced July 2005.

    Comments: 4 pages, 4 figures. Replaced with published version

    Journal ref: Phys. Rev. Lett. 95, 200202 (2005)

  38. arXiv:cond-mat/0506652  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT

    The theoretical capacity of the Parity Source Coder

    Authors: Stefano Ciliberti, Marc Mezard

    Abstract: The Parity Source Coder is a protocol for data compression which is based on a set of parity checks organized in a sparse random network. We consider here the case of memoryless unbiased binary sources. We show that the theoretical capacity saturate the Shannon limit at large K. We also find that the first corrections to the leading behavior are exponentially small, so that the behavior at finit… ▽ More

    Submitted 14 September, 2005; v1 submitted 24 June, 2005; originally announced June 2005.

    Comments: Added references, minor changes

    Journal ref: J. Stat Mech P10003 (2005)

  39. arXiv:cond-mat/0506053  [pdf, ps, other

    cond-mat.dis-nn cs.CC

    Pairs of SAT Assignment in Random Boolean Formulae

    Authors: Hervé Daudé, Marc Mezard, Thierry Mora, Riccardo Zecchina

    Abstract: We investigate geometrical properties of the random K-satisfiability problem using the notion of x-satisfiability: a formula is x-satisfiable if there exist two SAT assignments differing in Nx variables. We show the existence of a sharp threshold for this property as a function of the clause density. For large enough K, we prove that there exists a region of clause density, below the satisfiabil… ▽ More

    Submitted 19 September, 2007; v1 submitted 2 June, 2005; originally announced June 2005.

    Journal ref: Theoretical Computer Science 393 (2008) 260-279

  40. Clustering of solutions in the random satisfiability problem

    Authors: M. Mezard, T. Mora, R. Zecchina

    Abstract: Using elementary rigorous methods we prove the existence of a clustered phase in the random $K$-SAT problem, for $K\geq 8$. In this phase the solutions are grouped into clusters which are far away from each other. The results are in agreement with previous predictions of the cavity method and give a rigorous confirmation to one of its main building blocks. It can be generalized to other systems… ▽ More

    Submitted 4 April, 2005; originally announced April 2005.

    Comments: 4 pages, 1 figure

    Journal ref: Phys. Rev. Lett. 94, 197205 (2005)

  41. arXiv:cs/0309020  [pdf

    cs.CC cond-mat.dis-nn cs.DM

    Threshold values of Random K-SAT from the cavity method

    Authors: Stephan Mertens, Marc Mezard, Riccardo Zecchina

    Abstract: Using the cavity equations of \cite{mezard:parisi:zecchina:02,mezard:zecchina:02}, we derive the various threshold values for the number of clauses per variable of the random $K$-satisfiability problem, generalizing the previous results to $K \ge 4$. We also give an analytic solution of the equations, and some closed expressions for these thresholds, in an expansion around large $K$. The stabili… ▽ More

    Submitted 24 February, 2005; v1 submitted 12 September, 2003; originally announced September 2003.

    Comments: 38 pages; extended explanations and derivations; this version is going to appear in Random Structures & Algorithms

    ACM Class: F.2.0; G.2.0

  42. arXiv:cs/0212002  [pdf, ps, other

    cs.CC cond-mat.stat-mech

    Survey propagation: an algorithm for satisfiability

    Authors: A. Braunstein, M. Mezard, R. Zecchina

    Abstract: We study the satisfiability of randomly generated formulas formed by $M$ clauses of exactly $K$ literals over $N$ Boolean variables. For a given value of $N$ the problem is known to be most difficult with $α=M/N$ close to the experimental threshold $α_c$ separating the region where almost all formulas are SAT from the region where all formulas are UNSAT. Recent results from a statistical physics… ▽ More

    Submitted 4 April, 2006; v1 submitted 4 December, 2002; originally announced December 2002.

    Comments: 19 pages, 6 figure

    ACM Class: G.3

    Journal ref: Random Structures and Algorithms 27, 201-226 (2005)

  43. arXiv:cond-mat/0212451  [pdf, ps, other

    cond-mat.dis-nn cs.CC

    Constraint Satisfaction by Survey Propagation

    Authors: A. Braunstein, M. Mezard, M. Weigt, R. Zecchina

    Abstract: Survey Propagation is an algorithm designed for solving typical instances of random constraint satisfiability problems. It has been successfully tested on random 3-SAT and random $G(n,\frac{c}{n})$ graph 3-coloring, in the hard region of the parameter space. Here we provide a generic formalism which applies to a wide class of discrete Constraint Satisfaction Problems.

    Submitted 27 September, 2003; v1 submitted 18 December, 2002; originally announced December 2002.

    Comments: 8 pages, 5 figures

    Journal ref: Advances in Neural Information Processing Systems. Vol 9. Oxford University Press; 2005. 424

  44. arXiv:cond-mat/0207140  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.DM

    Alternative solutions to diluted p-spin models and XORSAT problems

    Authors: M. Mezard, F. Ricci-Tersenghi, R. Zecchina

    Abstract: We derive analytical solutions for p-spin models with finite connectivity at zero temperature. These models are the statistical mechanics equivalent of p-XORSAT problems in theoretical computer science. We give a full characterization of the phase diagram: location of the phase transitions (static and dynamic), together with a description of the clustering phenomenon taking place in configuratio… ▽ More

    Submitted 19 September, 2002; v1 submitted 4 July, 2002; originally announced July 2002.

    Comments: 14 pages, 14 figures. v3: small errors corrected, simpler notation used

    Journal ref: J. Stat. Phys. 111, 505 (2003)