-
Interacting Copies of Random Constraint Satisfaction Problems
Authors:
Maria Chiara Angelini,
Louise Budzynski,
Federico Ricci-Tersenghi
Abstract:
We study a system of $y=2$ coupled copies of a well-known constraint satisfaction problem (random hypergraph bicoloring) to examine how the ferromagnetic coupling between the copies affects the properties of the solution space. We solve the replicated model by applying the cavity method to the supervariables taking $2^y$ values. Our results show that a coupling of strength $γ$ between the copies d…
▽ More
We study a system of $y=2$ coupled copies of a well-known constraint satisfaction problem (random hypergraph bicoloring) to examine how the ferromagnetic coupling between the copies affects the properties of the solution space. We solve the replicated model by applying the cavity method to the supervariables taking $2^y$ values. Our results show that a coupling of strength $γ$ between the copies decreases the clustering threshold $α_d(γ)$, at which typical solutions shatters into disconnected components, therefore preventing numerical methods such as Monte Carlo Markov Chains from reaching equilibrium in polynomial time. This result needs to be reconciled with the observation that, in models with coupled copies, denser regions of the solution space should be more accessible. Additionally, we observe a change in the nature of the clustering phase transition, from discontinuous to continuous, in a wide $γ$ range. We investigate how the coupling affects the behavior of the Belief Propagation (BP) algorithm on finite-size instances and find that BP convergence is significantly impacted by the continuous transition. These results highlight the importance of better understanding algorithmic performance at the clustering transition, and call for a further exploration into the optimal use of re-weighting strategies designed to enhance algorithmic performances.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
Evidence of Replica Symmetry Breaking under the Nishimori conditions in epidemic inference on graphs
Authors:
Alfredo Braunstein,
Louise Budzynski,
Matteo Mariani,
Federico Ricci-Tersenghi
Abstract:
In Bayesian inference, computing the posterior distribution from the data is typically a non-trivial problem, which usually requires approximations such as mean-field approaches or numerical methods, like the Monte Carlo Markov Chain. Being a high-dimensional distribution over a set of correlated variables, the posterior distribution can undergo the notorious replica symmetry breaking transition.…
▽ More
In Bayesian inference, computing the posterior distribution from the data is typically a non-trivial problem, which usually requires approximations such as mean-field approaches or numerical methods, like the Monte Carlo Markov Chain. Being a high-dimensional distribution over a set of correlated variables, the posterior distribution can undergo the notorious replica symmetry breaking transition. When it happens, several mean-field methods and virtually every Monte Carlo scheme can not provide a reasonable approximation to the posterior and its marginals. Replica symmetry is believed to be guaranteed whenever the data is generated with known prior and likelihood distributions, namely under the so-called Nishimori conditions. In this paper, we break this belief, by providing a counter-example showing that, under the Nishimori conditions, replica symmetry breaking arises. Introducing a simple, geometrical model that can be thought of as a patient zero retrieval problem in a highly infectious regime of the epidemic Susceptible-Infectious model, we show that under the Nishimori conditions, there is evidence of replica symmetry breaking. We achieve this result by computing the instability of the replica symmetric cavity method toward the one step replica symmetry broken phase. The origin of this phenomenon -- replica symmetry breaking under the Nishimori conditions -- is likely due to the correlated disorder appearing in the epidemic models.
△ Less
Submitted 28 February, 2025; v1 submitted 18 February, 2025;
originally announced February 2025.
-
Statistical Mechanics of Inference in Epidemic Spreading
Authors:
Alfredo Braunstein,
Louise Budzynski,
Matteo Mariani
Abstract:
We investigate the information-theoretical limits of inference tasks in epidemic spreading on graphs in the thermodynamic limit. The typical inference tasks consist in computing observables of the posterior distribution of the epidemic model given observations taken from a ground truth (sometimes called planted) random trajectory. We can identify two main sources of quenched disorder: the graph en…
▽ More
We investigate the information-theoretical limits of inference tasks in epidemic spreading on graphs in the thermodynamic limit. The typical inference tasks consist in computing observables of the posterior distribution of the epidemic model given observations taken from a ground truth (sometimes called planted) random trajectory. We can identify two main sources of quenched disorder: the graph ensemble and the planted trajectory. The epidemic dynamics however induces non-trivial long-range correlations among individuals' states on the latter. This results in non-local correlated quenched disorder which unfortunately is typically hard to handle. To overcome this difficulty, we divide the dynamical process into two sets of variables: a set of stochastic independent variables (representing transmission delays), plus a set of correlated variables (the infection times) that depend deterministically on the first. Treating the former as quenched variables and the latter as dynamic ones, computing disorder average becomes feasible by means of the Replica Symmetric cavity method. We give theoretical predictions on the posterior probability distribution of the trajectory of each individual, conditioned to observations on the state of individuals at given times, focusing on the Susceptible Infectious (SI) model. In the Bayes-optimal condition, i.e. when true dynamic parameters are known, the inference task is expected to fall in the Replica Symmetric regime. We indeed provide predictions for the information theoretic limits of various inference tasks, in form of phase diagrams. We also identify a region, in the Bayes-Optimal setting, with strong hints of Replica Symmetry Breaking. When true parameters are unknown, we show how a maximum-likelihood procedure is able to recover them with mostly unaffected performance.
△ Less
Submitted 24 July, 2023; v1 submitted 13 April, 2023;
originally announced April 2023.
-
Small Coupling Expansion for Multiple Sequence Alignment
Authors:
Louise Budzynski,
Andrea Pagnani
Abstract:
The alignment of biological sequences such as DNA, RNA, and proteins, is one of the basic tools that allow to detect evolutionary patterns, as well as functional/structural characterizations between homologous sequences in different organisms. Typically, state-of-the-art bioinformatics tools are based on profile models that assume the statistical independence of the different sites of the sequence…
▽ More
The alignment of biological sequences such as DNA, RNA, and proteins, is one of the basic tools that allow to detect evolutionary patterns, as well as functional/structural characterizations between homologous sequences in different organisms. Typically, state-of-the-art bioinformatics tools are based on profile models that assume the statistical independence of the different sites of the sequences. Over the last years, it has become increasingly clear that homologous sequences show complex patterns of long-range correlations over the primary sequence as a consequence of the natural evolution process that selects genetic variants under the constraint of preserving the functional/structural determinants of the sequence. Here, we present a new alignment algorithm based on message passing techniques that overcomes the limitations of profile models. Our method is based on a new perturbative small-coupling expansion of the free energy of the model that assumes a linear chain approximation as the $0^\mathrm{th}$-order of the expansion. We test the potentiality of the algorithm against standard competing strategies on several biological sequences.
△ Less
Submitted 27 April, 2023; v1 submitted 7 October, 2022;
originally announced October 2022.
-
The closest vector problem and the zero-temperature p-spin landscape for lossy compression
Authors:
Alfredo Braunstein,
Louise Budzynski,
Stefano Crotti,
Federico Ricci-Tersenghi
Abstract:
We consider a high-dimensional random constrained optimization problem in which a set of binary variables is subjected to a linear system of equations. The cost function is a simple linear cost, measuring the Hamming distance with respect to a reference configuration. Despite its apparent simplicity, this problem exhibits a rich phenomenology. We show that different situations arise depending on t…
▽ More
We consider a high-dimensional random constrained optimization problem in which a set of binary variables is subjected to a linear system of equations. The cost function is a simple linear cost, measuring the Hamming distance with respect to a reference configuration. Despite its apparent simplicity, this problem exhibits a rich phenomenology. We show that different situations arise depending on the random ensemble of linear systems. When each variable is involved in at most two linear constraints, we show that the problem can be partially solved analytically, in particular we show that upon convergence, the zero-temperature limit of the cavity equations returns the optimal solution. We then study the geometrical properties of more general random ensembles. In particular we observe a range in the density of constraints at which the systems enters a glassy phase where the cost function has many minima. Interestingly, the algorithmic performances are only sensitive to another phase transition affecting the structure of configurations allowed by the linear constraints. We also extend our results to variables belonging to $\text{GF}(q)$, the Galois Field of order $q$. We show that increasing the value of $q$ allows to achieve a better optimum, which is confirmed by the Replica Symmetric cavity method predictions.
△ Less
Submitted 24 October, 2022; v1 submitted 1 July, 2022;
originally announced July 2022.
-
Biased measures for random Constraint Satisfaction Problems: larger interaction range and asymptotic expansion
Authors:
Louise Budzynski,
Guilhem Semerjian
Abstract:
We investigate the clustering transition undergone by an exemplary random constraint satisfaction problem, the bicoloring of $k$-uniform random hypergraphs, when its solutions are weighted non-uniformly, with a soft interaction between variables belonging to distinct hyperedges. We show that the threshold $α_{\rm d}(k)$ for the transition can be further increased with respect to a restricted inter…
▽ More
We investigate the clustering transition undergone by an exemplary random constraint satisfaction problem, the bicoloring of $k$-uniform random hypergraphs, when its solutions are weighted non-uniformly, with a soft interaction between variables belonging to distinct hyperedges. We show that the threshold $α_{\rm d}(k)$ for the transition can be further increased with respect to a restricted interaction within the hyperedges, and perform an asymptotic expansion of $α_{\rm d}(k)$ in the large $k$ limit. We find that $α_{\rm d}(k) = \frac{2^{k-1}}{k}(\ln k + \ln \ln k + γ_{\rm d} + o(1))$, where the constant $γ_{\rm d}$ is strictly larger than for the uniform measure over solutions.
△ Less
Submitted 4 September, 2020; v1 submitted 20 July, 2020;
originally announced July 2020.
-
The asymptotics of the clustering transition for random constraint satisfaction problems
Authors:
Louise Budzynski,
Guilhem Semerjian
Abstract:
Random Constraint Satisfaction Problems exhibit several phase transitions when their density of constraints is varied. One of these threshold phenomena, known as the clustering or dynamic transition, corresponds to a transition for an information theoretic problem called tree reconstruction. In this article we study this threshold for two CSPs, namely the bicoloring of $k$-uniform hypergraphs with…
▽ More
Random Constraint Satisfaction Problems exhibit several phase transitions when their density of constraints is varied. One of these threshold phenomena, known as the clustering or dynamic transition, corresponds to a transition for an information theoretic problem called tree reconstruction. In this article we study this threshold for two CSPs, namely the bicoloring of $k$-uniform hypergraphs with a density $α$ of constraints, and the $q$-coloring of random graphs with average degree $c$. We show that in the large $k,q$ limit the clustering transition occurs for $α= \frac{2^{k-1}}{k} (\ln k + \ln \ln k + γ_{\rm d} + o(1))$, $c= q (\ln q + \ln \ln q + γ_{\rm d}+ o(1))$, where $γ_{\rm d}$ is the same constant for both models. We characterize $γ_{\rm d}$ via a functional equation, solve the latter numerically to estimate $γ_{\rm d} \approx 0.871$, and obtain an analytic lowerbound $γ_{\rm d} \ge 1 + \ln (2 (\sqrt{2}-1)) \approx 0.812$. Our analysis unveils a subtle interplay of the clustering transition with the rigidity (naive reconstruction) threshold that occurs on the same asymptotic scale at $γ_{\rm r}=1$.
△ Less
Submitted 3 June, 2020; v1 submitted 21 November, 2019;
originally announced November 2019.
-
Biased landscapes for random Constraint Satisfaction Problems
Authors:
Louise Budzynski,
Federico Ricci-Tersenghi,
Guilhem Semerjian
Abstract:
The typical complexity of Constraint Satisfaction Problems (CSPs) can be investigated by means of random ensembles of instances. The latter exhibit many threshold phenomena besides their satisfiability phase transition, in particular a clustering or dynamic phase transition (related to the tree reconstruction problem) at which their typical solutions shatter into disconnected components. In this p…
▽ More
The typical complexity of Constraint Satisfaction Problems (CSPs) can be investigated by means of random ensembles of instances. The latter exhibit many threshold phenomena besides their satisfiability phase transition, in particular a clustering or dynamic phase transition (related to the tree reconstruction problem) at which their typical solutions shatter into disconnected components. In this paper we study the evolution of this phenomenon under a bias that breaks the uniformity among solutions of one CSP instance, concentrating on the bicoloring of k-uniform random hypergraphs. We show that for small k the clustering transition can be delayed in this way to higher density of constraints, and that this strategy has a positive impact on the performances of Simulated Annealing algorithms. We characterize the modest gain that can be expected in the large k limit from the simple implementation of the biasing idea studied here. This paper contains also a contribution of a more methodological nature, made of a review and extension of the methods to determine numerically the discontinuous dynamic transition threshold.
△ Less
Submitted 8 March, 2019; v1 submitted 5 November, 2018;
originally announced November 2018.
-
Inhomogeneous Gaussian Free Field inside the interacting arctic curve
Authors:
Etienne Granet,
Louise Budzynski,
Jérôme Dubail,
Jesper Lykke Jacobsen
Abstract:
The six-vertex model with domain-wall boundary conditions is one representative of a class of two-dimensional lattice statistical mechanics models that exhibit a phase separation known as the arctic curve phenomenon. In the thermodynamic limit, the degrees of freedom are completely frozen in a region near the boundary, while they are critically fluctuating in a central region. The arctic curve is…
▽ More
The six-vertex model with domain-wall boundary conditions is one representative of a class of two-dimensional lattice statistical mechanics models that exhibit a phase separation known as the arctic curve phenomenon. In the thermodynamic limit, the degrees of freedom are completely frozen in a region near the boundary, while they are critically fluctuating in a central region. The arctic curve is the phase boundary that separates those two regions. Critical fluctuations inside the arctic curve have been studied extensively, both in physics and in mathematics, in free models (i.e., models that map to free fermions, or equivalently to determinantal point processes). Here we study those critical fluctuations in the interacting (i.e., not free, not determinantal) six-vertex model, and provide evidence for the following two claims:
(i) the critical fluctuations are given by a Gaussian Free Field (GFF), as in the free case, but
(ii) contrarily to the free case, the GFF is inhomogeneous, meaning that its coupling constant $K$ becomes position-dependent, $K \rightarrow K({\rm x})$.
The evidence is mainly based on the numerical solution of appropriate Bethe ansatz equations with an imaginary extensive twist, and on transfer matrix computations, but the second claim is also supported by the analytic calculation of $K$ and its first two derivatives in selected points. Contrarily to the usual GFF, this inhomogeneous GFF is not defined in terms of the Green's function of the Laplacian $Δ= \nabla \cdot \nabla$ inside the critical domain, but instead, of the Green's function of a generalized Laplacian $Δ= \nabla \cdot \frac{1}{K} \nabla$ parametrized by the function $K$. Surprisingly, we also find that there is a change of regime when $Δ\leq -1/2$, with $K$ becoming singular at one point.
△ Less
Submitted 20 July, 2018;
originally announced July 2018.