-
The Physics of Local Optimization in Complex Disordered Systems
Authors:
Mutian Shen,
Gerardo Ortiz,
Zhiqiao Dong,
Martin Weigel,
Zohar Nussinov
Abstract:
Limited resources motivate decomposing large-scale problems into smaller, "local" subsystems and stitching together the so-found solutions. We explore the physics underlying this approach and discuss the concept of "local hardness", i.e., complexity from the local solver perspective, in determining the ground states of both P- and NP-hard spin-glasses and related systems. Depending on the model co…
▽ More
Limited resources motivate decomposing large-scale problems into smaller, "local" subsystems and stitching together the so-found solutions. We explore the physics underlying this approach and discuss the concept of "local hardness", i.e., complexity from the local solver perspective, in determining the ground states of both P- and NP-hard spin-glasses and related systems. Depending on the model considered, we observe varying scaling behaviors in how errors associated with local predictions decay as a function of the size of the solved subsystem. These errors stem from global critical threshold instabilities, characterized by gapless, avalanche-like excitations that follow scale-invariant size distributions. Away from criticality, local solvers quickly achieve high accuracy, aligning closely with the results of the more computationally intensive global minimization. These findings shed light on how Nature may operate solely through local actions at her disposal.
△ Less
Submitted 2 June, 2025; v1 submitted 5 May, 2025;
originally announced May 2025.
-
The Eggbox Ising Model
Authors:
Mutian Shen,
Yichen Xu,
Zohar Nussinov
Abstract:
We introduce a simple and versatile model that enables controlled design of rugged energy landscapes that realize different types of Parisi overlap distributions. This model captures quintessential aspects of Replica Symmetry Breaking (RSB) theory and may afford additional insights into complex systems and numerical methods for their analysis.
We introduce a simple and versatile model that enables controlled design of rugged energy landscapes that realize different types of Parisi overlap distributions. This model captures quintessential aspects of Replica Symmetry Breaking (RSB) theory and may afford additional insights into complex systems and numerical methods for their analysis.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
Reply to: Deep reinforced learning heuristic tested on spin-glass ground states: The larger picture
Authors:
Changjun Fan,
Mutian Shen,
Zohar Nussinov,
Zhong Liu,
Yizhou Sun,
Yang-Yu Liu
Abstract:
We wish to thank Stefan Boettcher for prompting us to further check and highlight the accuracy and scaling of our results. Here we provide a comprehensive response to the Comment written by him. We argue that the Comment did not account for the fairness of the comparison between different methods in searching for the spin-glass ground states. We demonstrate that, with a reasonably larger number of…
▽ More
We wish to thank Stefan Boettcher for prompting us to further check and highlight the accuracy and scaling of our results. Here we provide a comprehensive response to the Comment written by him. We argue that the Comment did not account for the fairness of the comparison between different methods in searching for the spin-glass ground states. We demonstrate that, with a reasonably larger number of initial spin configurations, our results agree with the asymptotic scaling form assumed by finite-size corrections.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Crystal Nucleation and Growth in Liquids: Cooperative Atom Attachment and Detachment
Authors:
Fangzheng Chen,
Zohar Nussinov,
K. F. Kelton
Abstract:
Classical theories of crystal nucleation and growth from the liquid assume activated processes that are interface limited, with the atoms individually joining the growing interface by jumps that occur at a rate that is determined by the diffusion coefficient in the liquid phase. These assumptions are in contradiction with the results of molecular dynamics studies that are presented here for superc…
▽ More
Classical theories of crystal nucleation and growth from the liquid assume activated processes that are interface limited, with the atoms individually joining the growing interface by jumps that occur at a rate that is determined by the diffusion coefficient in the liquid phase. These assumptions are in contradiction with the results of molecular dynamics studies that are presented here for supercooled Ni and Al20Ni60Zr20. Instead of diffusion-based attachment across the interface, atoms join the interface by making small changes so as to match the orientational order parameter of the nucleating crystal. Further, instead of joining individually multiple atoms join cooperatively, with the number of cooperative atoms increasing with decreasing temperature.
△ Less
Submitted 28 December, 2022;
originally announced December 2022.
-
Partons as unique ground states of quantum Hall parent Hamiltonians: The case of Fibonacci anyons
Authors:
M. Tanhayi Ahari,
S. Bandyopadhyay,
Z. Nussinov,
A. Seidel,
G. Ortiz
Abstract:
We present microscopic, multiple Landau level, (frustration-free and positive semi-definite) parent Hamiltonians whose ground states, realizing different quantum Hall fluids, are parton-like and whose excitations display either Abelian or non-Abelian braiding statistics. We prove ground state energy monotonicity theorems for systems with different particle numbers in multiple Landau levels, demons…
▽ More
We present microscopic, multiple Landau level, (frustration-free and positive semi-definite) parent Hamiltonians whose ground states, realizing different quantum Hall fluids, are parton-like and whose excitations display either Abelian or non-Abelian braiding statistics. We prove ground state energy monotonicity theorems for systems with different particle numbers in multiple Landau levels, demonstrate S-duality in the case of toroidal geometry, and establish complete sets of zero modes of special Hamiltonians stabilizing parton-like states. The emergent Entangled Pauli Principle (EPP), introduced in Phys. Rev. B 98, 161118(R) (2018) and which defines the ``DNA'' of the quantum Hall fluid, is behind the exact determination of the topological characteristics of the fluid, including charge and braiding statistics of excitations, and effective edge theory descriptions. When the closed-shell condition is satisfied, the densest (i.e., the highest density and lowest total angular momentum) zero-energy mode is a unique parton state. We conjecture that parton-like states generally span the subspace of many-body wave functions with the two-body $M$-clustering property within any given number of Landau levels. General arguments are supplemented by rigorous considerations for the $M=3$ case of fermions in four Landau levels. For this case, we establish that the zero mode counting can be done by enumerating certain patterns consistent with an underlying EPP. We apply the coherent state approach to show that the elementary (localized) bulk excitations are Fibonacci anyons. This demonstrates that the DNA associated with fractional quantum Hall states encodes all universal properties. Specifically, for parton-like states, we establish a link with tensor network structures of finite bond dimension that emerge via root level entanglement.
△ Less
Submitted 7 April, 2023; v1 submitted 20 April, 2022;
originally announced April 2022.
-
Finding spin glass ground states through deep reinforcement learning
Authors:
Changjun Fan,
Mutian Shen,
Zohar Nussinov,
Zhong Liu,
Yizhou Sun,
Yang-Yu Liu
Abstract:
Spin glasses are disordered magnets with random interactions that are, generally, in conflict with each other. Finding the ground states of spin glasses is not only essential for the understanding of the nature of disordered magnetic and other physical systems, but also useful to solve a broad array of hard combinatorial optimization problems across multiple disciplines. Despite decades-long effor…
▽ More
Spin glasses are disordered magnets with random interactions that are, generally, in conflict with each other. Finding the ground states of spin glasses is not only essential for the understanding of the nature of disordered magnetic and other physical systems, but also useful to solve a broad array of hard combinatorial optimization problems across multiple disciplines. Despite decades-long efforts, an algorithm with both high accuracy and high efficiency is still lacking. Here we introduce DIRAC - a deep reinforcement learning framework, which can be trained purely on small-scale spin glass instances and then applied to arbitrarily large ones. DIRAC displays better scalability than other methods and can be leveraged to enhance any thermal annealing method. Extensive calculations on 2D, 3D and 4D Edwards-Anderson spin glass instances demonstrate the superior performance of DIRAC over existing methods. As many hard combinatorial optimization problems have Ising spin glass formulations, our results suggest a promising tool in solving these hard problems. Moreover, the presented algorithm will help us better understand the nature of the low-temperature spin-glass phase, which is a fundamental challenge in statistical physics.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
A new nature inspired modularity function adapted for unsupervised learning involving spatially embedded networks: A comparative analysis
Authors:
Raj Kishore,
Zohar Nussinov,
Kisor Kumar Sahu
Abstract:
Unsupervised machine learning methods can be of great help in many traditional engineering disciplines, where huge amount of labeled data is not readily available or is extremely difficult or costly to generate. Two specific examples include the structure of granular materials and atomic structure of metallic glasses. While the former is critically important for several hundreds of billion dollars…
▽ More
Unsupervised machine learning methods can be of great help in many traditional engineering disciplines, where huge amount of labeled data is not readily available or is extremely difficult or costly to generate. Two specific examples include the structure of granular materials and atomic structure of metallic glasses. While the former is critically important for several hundreds of billion dollars global industries, the latter is still a big puzzle in fundamental science. One thing is common in both the examples is that the particles are the elements of the ensembles that are embedded in Euclidean space and one can create a spatially embedded network to represent their key features. Some recent studies show that clustering, which generically refers to unsupervised learning, holds great promise in partitioning these networks. In many complex networks, the spatial information of nodes play very important role in determining the network properties. So understanding the structure of such networks is very crucial. We have compared the performance of our newly developed modularity function with some of the well-known modularity functions. We performed this comparison by finding the best partition in 2D and 3D granular assemblies. We show that for the class of networks considered in this article, our method produce much better results than the competing methods.
△ Less
Submitted 18 July, 2020;
originally announced July 2020.
-
The Binomial Spin Glass
Authors:
Mohammad-Sadegh Vaezi,
Gerardo Ortiz,
Martin Weigel,
Zohar Nussinov
Abstract:
To establish a unified framework for studying both discrete and continuous coupling distributions, we introduce the {\it binomial} spin glass, a class of models where the couplings are sums of $m$ identically distributed Bernoulli random variables. In the continuum limit $m \to \infty$, the class reduces to one with Gaussian couplings, while $m=1$ corresponds to the $\pm J$ spin glass. We demonstr…
▽ More
To establish a unified framework for studying both discrete and continuous coupling distributions, we introduce the {\it binomial} spin glass, a class of models where the couplings are sums of $m$ identically distributed Bernoulli random variables. In the continuum limit $m \to \infty$, the class reduces to one with Gaussian couplings, while $m=1$ corresponds to the $\pm J$ spin glass. We demonstrate that for short-range Ising models on $d$-dimensional hypercubic lattices the ground-state entropy density for $N$ spins is bounded from above by $(\sqrt{d/2m} + 1/N)\ln2$, and further show that the actual entropies follow the scaling behavior implied by this bound. We thus uncover a fundamental non-commutativity of the thermodynamic and continuous coupling limits that leads to the presence or absence of degeneracies depending on the precise way the limits are taken. Exact calculations of defect energies reveal a crossover length scale $L^\ast(m) \sim L^κ$ below which the binomial spin glass is indistinguishable from the Gaussian system. Since $κ= -1/(2θ)$, where $θ$ is the spin-stiffness exponent, discrete couplings become irrelevant at large scales for systems with a finite-temperature spin-glass phase.
△ Less
Submitted 22 July, 2018; v1 submitted 22 December, 2017;
originally announced December 2017.
-
The Stochastic Replica Approach to Machine Learning: Stability and Parameter Optimization
Authors:
Patrick Chao,
Tahereh Mazaheri,
Bo Sun,
Nicholas B. Weingartner,
Zohar Nussinov
Abstract:
We introduce a statistical physics inspired supervised machine learning algorithm for classification and regression problems. The method is based on the invariances or stability of predicted results when known data is represented as expansions in terms of various stochastic functions. The algorithm predicts the classification/regression values of new data by combining (via voting) the outputs of t…
▽ More
We introduce a statistical physics inspired supervised machine learning algorithm for classification and regression problems. The method is based on the invariances or stability of predicted results when known data is represented as expansions in terms of various stochastic functions. The algorithm predicts the classification/regression values of new data by combining (via voting) the outputs of these numerous linear expansions in randomly chosen functions. The few parameters (typically only one parameter is used in all studied examples) that this model has may be automatically optimized. The algorithm has been tested on 10 diverse training data sets of various types and feature space dimensions. It has been shown to consistently exhibit high accuracy and readily allow for optimization of parameters, while simultaneously avoiding pitfalls of existing algorithms such as those associated with class imbalance. We very briefly speculate on whether spatial coordinates in physical theories may be viewed as emergent "features" that enable a robust machine learning type description of data with generic low order smooth functions.
△ Less
Submitted 16 November, 2018; v1 submitted 18 August, 2017;
originally announced August 2017.
-
A phase space approach to supercooled liquids and a universal collapse of their viscosity
Authors:
Nicholas B. Weingartner,
Chris Pueblo,
Flavio S. Nogueira,
K. F. Kelton,
Zohar Nussinov
Abstract:
A broad fundamental understanding of the mechanisms underlying the phenomenology of supercooled liquids has remained elusive, despite decades of intense exploration. When supercooled beneath its characteristic melting temperature, a liquid sees a sharp rise in its viscosity over a narrow temperature range, eventually becoming frozen on laboratory timescales. Explaining this immense increase in vis…
▽ More
A broad fundamental understanding of the mechanisms underlying the phenomenology of supercooled liquids has remained elusive, despite decades of intense exploration. When supercooled beneath its characteristic melting temperature, a liquid sees a sharp rise in its viscosity over a narrow temperature range, eventually becoming frozen on laboratory timescales. Explaining this immense increase in viscosity is one of the principle goals of condensed matter physicists. To that end, numerous theoretical frameworks have been proposed which explain and reproduce the temperature dependence of the viscosity of supercooled liquids. Each of these frameworks appears only applicable to specific classes of glassformers and each possess a number of variable parameters. Here we describe a classical framework for explaining the dynamical behavior of supercooled liquids based on statistical mechanical considerations, and possessing only a single variable parameter. This parameter varies weakly from liquid to liquid. Furthermore, as predicted by this new classical theory and its earlier quantum counterpart, we find with the aid of a small dimensionless constant that varies in size from $\sim 0.05-0.12$, a universal (16 decade) collapse of the viscosity data as a function of temperature. The collapse appears in all known types of glass forming supercooled liquids (silicates, metallic alloys, organic systems, chalcogenide, sugars, and water).
△ Less
Submitted 9 November, 2016;
originally announced November 2016.
-
Probing Local Structure in Glass by the Application of Shear
Authors:
Nicholas B. Weingartner,
Zohar Nussinov
Abstract:
The glass transition remains one of the great unsolved mysteries of contemporary condensed matter physics. When crystallization is bypassed by rapid cooling, a supercooled liquid, retaining amorphous particle arrangment, results. The physical phenomenology of supercooled liquids is as vast as it is interesting. Most significant, the viscosity of the supercooled liquid displays an incredible increa…
▽ More
The glass transition remains one of the great unsolved mysteries of contemporary condensed matter physics. When crystallization is bypassed by rapid cooling, a supercooled liquid, retaining amorphous particle arrangment, results. The physical phenomenology of supercooled liquids is as vast as it is interesting. Most significant, the viscosity of the supercooled liquid displays an incredible increase over a narrow temperature range. Eventually, the supercooled liquid ceases to flow, becomes a glass, and gains rigidity and solid-like behaviors. Understanding what underpins the monumental growth of viscosity, and how rigidity results without long range order is a long-sought goal. Many theories of the glassy slowdown require the growth of static lengthscale related to structure with lowering of the temperature. To that end, we have proposed a new, natural lengthscale- "the shear penetration depth". This lengthscale quantifies the structural connectivity of the supercooled liquid. The shear penetration depth is defined as the distance up to which a shear perturbation applied to the boundary propagates into the liquid. We provide numerical data, based on the simulations of $NiZr_2$, illustrating that this length scale exhibits dramatic growth and eventual divergence upon approach to the glass transition. We further discuss this in relation to percolating structural connectivity and a new theory of the glass transition.
△ Less
Submitted 12 February, 2016;
originally announced February 2016.
-
Inference of hidden structures in complex physical systems by multi-scale clustering
Authors:
Z. Nussinov,
P. Ronhovde,
Dandan Hu,
S. Chakrabarty,
M. Sahu,
Bo Sun,
N. A. Mauro,
K. K. Sahu
Abstract:
We survey the application of a relatively new branch of statistical physics--"community detection"-- to data mining. In particular, we focus on the diagnosis of materials and automated image segmentation. Community detection describes the quest of partitioning a complex system involving many elements into optimally decoupled subsets or communities of such elements. We review a multiresolution vari…
▽ More
We survey the application of a relatively new branch of statistical physics--"community detection"-- to data mining. In particular, we focus on the diagnosis of materials and automated image segmentation. Community detection describes the quest of partitioning a complex system involving many elements into optimally decoupled subsets or communities of such elements. We review a multiresolution variant which is used to ascertain structures at different spatial and temporal scales. Significant patterns are obtained by examining the correlations between different independent solvers. Similar to other combinatorial optimization problems in the NP complexity class, community detection exhibits several phases. Typically, illuminating orders are revealed by choosing parameters that lead to extremal information theory correlations.
△ Less
Submitted 14 January, 2016; v1 submitted 5 March, 2015;
originally announced March 2015.
-
A locally preferred structure characterises all dynamical regimes of a supercooled liquid
Authors:
Ryan Soklaski,
Vy Tran,
Zohar Nussinov,
K. F. Kelton,
Li Yang
Abstract:
Recent experimental results suggest that metallic liquids universally exhibit a high-temperature dynamical crossover, which is correlated with the glass transition temperature ($T_{g}$). We demonstrate, using molecular dynamics results for Cu64Zr36, that this temperature, $T_{A} \approx 2 \times T_{g}$, is linked with cooperative atomic rearrangements that produce domains of connected icosahedra.…
▽ More
Recent experimental results suggest that metallic liquids universally exhibit a high-temperature dynamical crossover, which is correlated with the glass transition temperature ($T_{g}$). We demonstrate, using molecular dynamics results for Cu64Zr36, that this temperature, $T_{A} \approx 2 \times T_{g}$, is linked with cooperative atomic rearrangements that produce domains of connected icosahedra. Supercooling to a new characteristic temperature, $T_{D}$, is shown to produce higher order cooperative rearrangements amongst connected icosahedra, leading to large-scale domain fluctuations and the onset of glassy dynamics. These extensive domains then abruptly stabilize above $T_{g}$ and eventually percolate before the glass is formed. All characteristic temperatures ($T_{A}$, $T_{D}$ and $T_{g}$) are thus connected by successive manifestations of the structural cooperativity that begins at $T_{A}$.
△ Less
Submitted 23 March, 2016; v1 submitted 5 February, 2015;
originally announced February 2015.
-
An interacting replica approach applied to the traveling salesman problem
Authors:
Bo Sun,
Blake Leonard,
Peter Ronhovde,
Zohar Nussinov
Abstract:
We present a physics inspired heuristic method for solving combinatorial optimization problems. Our approach is specifically motivated by the desire to avoid trapping in metastable local minima- a common occurrence in hard problems with multiple extrema. Our method involves (i) coupling otherwise independent simulations of a system ("replicas") via geometrical distances as well as (ii) probabilist…
▽ More
We present a physics inspired heuristic method for solving combinatorial optimization problems. Our approach is specifically motivated by the desire to avoid trapping in metastable local minima- a common occurrence in hard problems with multiple extrema. Our method involves (i) coupling otherwise independent simulations of a system ("replicas") via geometrical distances as well as (ii) probabilistic inference applied to the solutions found by individual replicas. The {\it ensemble} of replicas evolves as to maximize the inter-replica correlation while simultaneously minimize the local intra-replica cost function (e.g., the total path length in the Traveling Salesman Problem within each replica). We demonstrate how our method improves the performance of rudimentary local optimization schemes long applied to the NP hard Traveling Salesman Problem. In particular, we apply our method to the well-known "$k$-opt" algorithm and examine two particular cases- $k=2$ and $k=3$. With the aid of geometrical coupling alone, we are able to determine for the optimum tour length on systems up to $280$ cities (an order of magnitude larger than the largest systems typically solved by the bare $k=3$ opt). The probabilistic replica-based inference approach improves $k-opt$ even further and determines the optimal solution of a problem with $318$ cities and find tours whose total length is close to that of the optimal solutions for other systems with a larger number of cities.
△ Less
Submitted 14 March, 2016; v1 submitted 27 June, 2014;
originally announced June 2014.
-
Improving the performance of algorithms to find communities in networks
Authors:
Richard K. Darst,
Zohar Nussinov,
Santo Fortunato
Abstract:
Many algorithms to detect communities in networks typically work without any information on the cluster structure to be found, as one has no a priori knowledge of it, in general. Not surprisingly, knowing some features of the unknown partition could help its identification, yielding an improvement of the performance of the method. Here we show that, if the number of clusters were known beforehand,…
▽ More
Many algorithms to detect communities in networks typically work without any information on the cluster structure to be found, as one has no a priori knowledge of it, in general. Not surprisingly, knowing some features of the unknown partition could help its identification, yielding an improvement of the performance of the method. Here we show that, if the number of clusters were known beforehand, standard methods, like modularity optimization, would considerably gain in accuracy, mitigating the severe resolution bias that undermines the reliability of the results of the original unconstrained version. The number of clusters can be inferred from the spectra of the recently introduced non-backtracking and flow matrices, even in benchmark graphs with realistic community structure. The limit of such two-step procedure is the overhead of the computation of the spectra.
△ Less
Submitted 1 December, 2014; v1 submitted 15 November, 2013;
originally announced November 2013.
-
Algorithm independent bounds on community detection problems and associated transitions in stochastic block model graphs
Authors:
Richard K. Darst,
David R. Reichman,
Peter Ronhovde,
Zohar Nussinov
Abstract:
We derive rigorous bounds for well-defined community structure in complex networks for a stochastic block model (SBM) benchmark. In particular, we analyze the effect of inter-community "noise" (inter-community edges) on any "community detection" algorithm's ability to correctly group nodes assigned to a planted partition, a problem which has been proven to be NP complete in a standard rendition. O…
▽ More
We derive rigorous bounds for well-defined community structure in complex networks for a stochastic block model (SBM) benchmark. In particular, we analyze the effect of inter-community "noise" (inter-community edges) on any "community detection" algorithm's ability to correctly group nodes assigned to a planted partition, a problem which has been proven to be NP complete in a standard rendition. Our result does not rely on the use of any one particular algorithm nor on the analysis of the limitations of inference. Rather, we turn the problem on its head and work backwards to examine when, in the first place, well defined structure may exist in SBMs.The method that we introduce here could potentially be applied to other computational problems. The objective of community detection algorithms is to partition a given network into optimally disjoint subgraphs (or communities). Similar to k-SAT and other combinatorial optimization problems, "community detection" exhibits different phases. Networks that lie in the "unsolvable phase" lack well-defined structure and thus have no partition that is meaningful. Solvable systems splinter into two disparate phases: those in the "hard" phase and those in the "easy" phase. As befits its name, within the easy phase, a partition is easy to achieve by known algorithms. When a network lies in the hard phase, it still has an underlying structure yet finding a meaningful partition which can be checked in polynomial time requires an exhaustive computational effort that rapidly increases with the size of the graph. When taken together, (i) the rigorous results that we report here on when graphs have an underlying structure and (ii) recent results concerning the limits of rather general algorithms, suggest bounds on the hard phase.
△ Less
Submitted 10 July, 2014; v1 submitted 24 June, 2013;
originally announced June 2013.
-
An edge density definition of overlapping and weighted graph communities
Authors:
Richard K. Darst David R. Reichman Peter Ronhovde,
Zohar Nussinov
Abstract:
Community detection in networks refers to the process of seeking strongly internally connected groups of nodes which are weakly externally connected. In this work, we introduce and study a community definition based on internal edge density. Beginning with the simple concept that edge density equals number of edges divided by maximal number of edges, we apply this definition to a variety of node a…
▽ More
Community detection in networks refers to the process of seeking strongly internally connected groups of nodes which are weakly externally connected. In this work, we introduce and study a community definition based on internal edge density. Beginning with the simple concept that edge density equals number of edges divided by maximal number of edges, we apply this definition to a variety of node and community arrangements to show that our definition yields sensible results. Our community definition is equivalent to that of the Absolute Potts Model community detection method (Phys. Rev. E 81, 046114 (2010)), and the performance of that method validates the usefulness of our definition across a wide variety of network types. We discuss how this definition can be extended to weighted, and multigraphs, and how the definition is capable of handling overlapping communities and local algorithms. We further validate our definition against the recently proposed Affiliation Graph Model (arXiv:1205.6228 [cs.SI]) and show that we can precisely solve these benchmarks. More than proposing an end-all community definition, we explain how studying the detailed properties of community definitions is important in order to validate that definitions do not have negative analytic properties. We urge that community definitions be separated from community detection algorithms and propose that community definitions be further evaluated by criteria such as these.
△ Less
Submitted 14 January, 2013;
originally announced January 2013.
-
Local multiresolution order in community detection
Authors:
Peter Ronhovde,
Zohar Nussinov
Abstract:
Community detection algorithms attempt to find the best clusters of nodes in an arbitrary complex network. Multi-scale ("multiresolution") community detection extends the problem to identify the best network scale(s) for these clusters. The latter task is generally accomplished by analyzing community stability simultaneously for all clusters in the network. In the current work, we extend this gene…
▽ More
Community detection algorithms attempt to find the best clusters of nodes in an arbitrary complex network. Multi-scale ("multiresolution") community detection extends the problem to identify the best network scale(s) for these clusters. The latter task is generally accomplished by analyzing community stability simultaneously for all clusters in the network. In the current work, we extend this general approach to define local multiresolution methods, which enable the extraction of well-defined local communities even if the global community structure is vaguely defined in an average sense. Toward this end, we propose measures analogous to variation of information and normalized mutual information that are used to quantitatively identify the best resolution(s) at the community level based on correlations between clusters in independently-solved systems. We demonstrate our method on two constructed networks as well as a real network and draw inferences about local community strength. Our approach is independent of the applied community detection algorithm save for the inherent requirement that the method be able to identify communities across different network scales, with appropriate changes to account for how different resolutions are evaluated or defined in a particular community detection method. It should, in principle, easily adapt to alternative community comparison measures.
△ Less
Submitted 18 November, 2014; v1 submitted 24 August, 2012;
originally announced August 2012.
-
Automatic Segmentation of Fluorescence Lifetime Microscopy Images of Cells Using Multi-Resolution Community Detection
Authors:
Dandan Hu,
Pinaki Sarder,
Peter Ronhovde,
Sandra Orthaus,
Samuel Achilefu,
Zohar Nussinov
Abstract:
We have developed an automatic method for segmenting fluorescence lifetime (FLT) imaging microscopy (FLIM) images of cells inspired by a multi-resolution community detection (MCD) based network segmentation method. The image processing problem is framed as identifying segments with respective average FLTs against a background in FLIM images. The proposed method segments a FLIM image for a given re…
▽ More
We have developed an automatic method for segmenting fluorescence lifetime (FLT) imaging microscopy (FLIM) images of cells inspired by a multi-resolution community detection (MCD) based network segmentation method. The image processing problem is framed as identifying segments with respective average FLTs against a background in FLIM images. The proposed method segments a FLIM image for a given resolution of the network composed using image pixels as the nodes and similarity between the pixels as the edges. In the resulting segmentation, low network resolution leads to larger segments and high network resolution leads to smaller segments. Further, the mean-square error (MSE) in estimating the FLT segments in a FLIM image using the proposed method was found to be consistently decreasing with increasing resolution of the corresponding network. The proposed MCD method outperformed a popular spectral clustering based method in performing FLIM image segmentation. The spectral segmentation method introduced noisy segments in its output at high resolution. It was unable to offer a consistent decrease in MSE with increasing resolution.
△ Less
Submitted 7 May, 2013; v1 submitted 22 August, 2012;
originally announced August 2012.
-
The stability to instability transition in the structure of large scale networks
Authors:
Dandan Hu,
Peter Ronhovde,
Zohar Nussinov
Abstract:
We examine phase transitions between the easy, hard, and the unsolvable phases when attempting to identify structure in large complex networks (community detection) in the presence of disorder induced by network noise (spurious links that obscure structure), heat bath temperature $T$, and system size $N$. When present, transitions at low temperature or low noise correspond to entropy driven (or "o…
▽ More
We examine phase transitions between the easy, hard, and the unsolvable phases when attempting to identify structure in large complex networks (community detection) in the presence of disorder induced by network noise (spurious links that obscure structure), heat bath temperature $T$, and system size $N$. When present, transitions at low temperature or low noise correspond to entropy driven (or "order by disorder") annealing effects wherein stability may initially increase as temperature or noise is increased before becoming unsolvable at sufficiently high temperature or noise. Additional transitions between contending viable solutions (such as those at different natural scales) are also possible. When analyzing community structure via a dynamical approach, "chaotic-type" transitions were previously identified [Phil. Mag. {\bf 92} 406 (2012)]. The correspondence between the spin-glass-type complexity transitions and transitions into chaos in dynamical analogs might extend to other hard computational problems. In this work, we examine large networks that have a large number of communities. We infer that large systems at a constant ratio of $q$ to the number of nodes $N$ asymptotically tend toward insolvability in the limit of large $N$ for any positive $T$. The asymptotic behavior of temperatures below which structure identification might be possible, $T_\times =O[1/\log q]$, decreases slowly, so for practical system sizes, there remains an accessible, and generally easy, global solvable phase at low temperature. We further employ multivariate Tutte polynomials to show that increasing $q$ emulates increasing $T$ for a general Potts model, leading to a similar stability region at low $T$. Given the relation between Tutte and Jones polynomials, our results further suggest a link between the above complexity transitions and transitions associated with random knots.
△ Less
Submitted 18 April, 2012;
originally announced April 2012.
-
Global disorder transition in the community structure of large-q Potts systems
Authors:
Peter Ronhovde,
Dandan Hu,
Zohar Nussinov
Abstract:
We examine a global disorder transition when identifying community structure in an arbitrary complex network. Earlier, we illustrated [Phil. Mag. 92, 406 (2012)] that "community detection" (CD) generally exhibits disordered (or unsolvable) and ordered (solvable) phases of both high and low computational complexity along with corresponding transitions from regular to chaotic dynamics in derived sys…
▽ More
We examine a global disorder transition when identifying community structure in an arbitrary complex network. Earlier, we illustrated [Phil. Mag. 92, 406 (2012)] that "community detection" (CD) generally exhibits disordered (or unsolvable) and ordered (solvable) phases of both high and low computational complexity along with corresponding transitions from regular to chaotic dynamics in derived systems. Using an exact generalized dimensional reduction inequality, multivariate Tutte polynomials, and other considerations, we illustrate how increasing the number of communities q emulates increasing the heat bath temperature T for a general weighted Potts model, leading to global disorder in the community structure of arbitrary large graphs. Dimensional reduction bounds lead to results similar to those suggested by mean-field type approaches. Large systems tend toward global insolvability in the limit of large q above a crossover temperature $T_\times\approx L|J_e|/[N\ln{} q]$ where |J_e| is a typical interaction strength, L is the number of edges, and N is the number of nodes. For practical system sizes, a solvable phase is generally accessible at low T. The global nature of the disorder transition does not preclude solutions by local CD algorithms (even those that employ global cost function parameters) as long as community evaluations are locally determined.
△ Less
Submitted 23 August, 2012; v1 submitted 16 April, 2012;
originally announced April 2012.
-
A Replica Inference Approach to Unsupervised Multi-Scale Image Segmentation
Authors:
Dandan Hu,
Peter Ronhovde,
Zohar Nussinov
Abstract:
We apply a replica inference based Potts model method to unsupervised image segmentation on multiple scales. This approach was inspired by the statistical mechanics problem of "community detection" and its phase diagram. Specifically, the problem is cast as identifying tightly bound clusters ("communities" or "solutes") against a background or "solvent". Within our multiresolution approach, we com…
▽ More
We apply a replica inference based Potts model method to unsupervised image segmentation on multiple scales. This approach was inspired by the statistical mechanics problem of "community detection" and its phase diagram. Specifically, the problem is cast as identifying tightly bound clusters ("communities" or "solutes") against a background or "solvent". Within our multiresolution approach, we compute information theory based correlations among multiple solutions ("replicas") of the same graph over a range of resolutions. Significant multiresolution structures are identified by replica correlations as manifest in information theory overlaps. With the aid of these correlations as well as thermodynamic measures, the phase diagram of the corresponding Potts model is analyzed both at zero and finite temperatures. Optimal parameters corresponding to a sensible unsupervised segmentation correspond to the "easy phase" of the Potts model. Our algorithm is fast and shown to be at least as accurate as the best algorithms to date and to be especially suited to the detection of camouflaged images.
△ Less
Submitted 28 June, 2011;
originally announced June 2011.
-
Detecting hidden spatial and spatio-temporal structures in glasses and complex physical systems by multiresolution network clustering
Authors:
P. Ronhovde,
S. Chakrabarty,
D. Hu,
M. Sahu,
K. F. Kelton,
N. A. Mauro,
K . K. Sahu,
Z. Nussinov
Abstract:
We elaborate on a general method that we recently introduced for characterizing the "natural" structures in complex physical systems via a multiscale network based approach for the data mining of such structures. The approach is based on "community detection" wherein interacting particles are partitioned into "an ideal gas" of optimally decoupled groups of particles. Specifically, we construct a s…
▽ More
We elaborate on a general method that we recently introduced for characterizing the "natural" structures in complex physical systems via a multiscale network based approach for the data mining of such structures. The approach is based on "community detection" wherein interacting particles are partitioned into "an ideal gas" of optimally decoupled groups of particles. Specifically, we construct a set of network representations ("replicas") of the physical system based on interatomic potentials and apply a multiscale clustering ("multiresolution community detection") analysis using information-based correlations among the replicas. Replicas may be (i) different representations of an identical static system or (ii) embody dynamics by when considering replicas to be time separated snapshots of the system (with a tunable time separation) or (iii) encode general correlations when different replicas correspond to different representations of the entire history of the system as it evolves in space-time. We apply our method to computer simulations of a binary Kob-Andersen Lennard-Jones system, a ternary model system, and to atomic coordinates in a ZrPt system as gleaned by reverse Monte Carlo analysis of experimentally determined structure factors. We identify the dominant structures (disjoint or overlapping) and general length scales by analyzing extrema of the information theory measures. We speculate on possible links between (i) physical transitions or crossovers and (ii) changes in structures found by this method as well as phase transitions associated with the computational complexity of the community detection problem. We briefly also consider continuum approaches and discuss the shear penetration depth in elastic media; this length scale increases as the system becomes increasingly rigid.
△ Less
Submitted 4 April, 2011; v1 submitted 8 February, 2011;
originally announced February 2011.
-
Detection of hidden structures on all scales in amorphous materials and complex physical systems: basic notions and applications to networks, lattice systems, and glasses
Authors:
P. Ronhovde,
S. Chakrabarty,
M. Sahu,
K. K. Sahu,
K. F. Kelton,
N. Mauro,
Z. Nussinov
Abstract:
Recent decades have seen the discovery of numerous complex materials. At the root of the complexity underlying many of these materials lies a large number of possible contending atomic- and larger-scale configurations and the intricate correlations between their constituents. For a detailed understanding, there is a need for tools that enable the detection of pertinent structures on all spatial an…
▽ More
Recent decades have seen the discovery of numerous complex materials. At the root of the complexity underlying many of these materials lies a large number of possible contending atomic- and larger-scale configurations and the intricate correlations between their constituents. For a detailed understanding, there is a need for tools that enable the detection of pertinent structures on all spatial and temporal scales. Towards this end, we suggest a new method by invoking ideas from network analysis and information theory. Our method efficiently identifies basic unit cells and topological defects in systems with low disorder and may analyze general amorphous structures to identify candidate natural structures where a clear definition of order is lacking. This general unbiased detection of physical structure does not require a guess as to which of the system properties should be deemed as important and may constitute a natural point of departure for further analysis. The method applies to both static and dynamic systems.
△ Less
Submitted 29 December, 2010;
originally announced January 2011.
-
High temperature correlation functions: universality, extraction of exchange interactions, divergent correlation lengths and generalized Debye length scales
Authors:
Saurish Chakrabarty,
Zohar Nussinov
Abstract:
We derive a universal form for the correlation function of general n component systems in the limit of high temperatures or weak coupling. This enables the extraction of effective microscopic interactions from measured high temperature correlation functions. We find that in systems with long range interactions, there exist diverging correlation lengths with amplitudes that tend to zero in the high…
▽ More
We derive a universal form for the correlation function of general n component systems in the limit of high temperatures or weak coupling. This enables the extraction of effective microscopic interactions from measured high temperature correlation functions. We find that in systems with long range interactions, there exist diverging correlation lengths with amplitudes that tend to zero in the high temperature limit. For general systems with disparate long range interactions, we introduce the notion of generalized Debye length (and time) scales and further relate it to the divergence of the largest correlation length in the high temperature (or weak coupling) limit.
△ Less
Submitted 6 May, 2011; v1 submitted 17 August, 2010;
originally announced August 2010.
-
Phase transitions in random Potts systems and the community detection problem: spin-glass type and dynamic perspectives
Authors:
Dandan Hu,
Peter Ronhovde,
Zohar Nussinov
Abstract:
Phase transitions in spin glass type systems and, more recently, in related computational problems have gained broad interest in disparate arenas. In the current work, we focus on the "community detection" problem when cast in terms of a general Potts spin glass type problem. As such, our results apply to rather broad Potts spin glass type systems. Community detection describes the general problem…
▽ More
Phase transitions in spin glass type systems and, more recently, in related computational problems have gained broad interest in disparate arenas. In the current work, we focus on the "community detection" problem when cast in terms of a general Potts spin glass type problem. As such, our results apply to rather broad Potts spin glass type systems. Community detection describes the general problem of partitioning a complex system involving many elements into optimally decoupled "communities" of such elements. We report on phase transitions between solvable and unsolvable regimes. Solvable region may further split into "easy" and "hard" phases. Spin glass type phase transitions appear at both low and high temperatures (or noise). Low temperature transitions correspond to an "order by disorder" type effect wherein fluctuations render the system ordered or solvable. Separate transitions appear at higher temperatures into a disordered (or an unsolvable) phase. Different sorts of randomness lead to disparate behaviors. We illustrate the spin glass character of both transitions and report on memory effects. We further relate Potts type spin systems to mechanical analogs and suggest how chaotic-type behavior in general thermodynamic systems can indeed naturally arise in hard-computational problems and spin-glasses. The correspondence between the two types of transitions (spin glass and dynamic) is likely to extend across a larger spectrum of spin glass type systems and hard computational problems. We briefly discuss potential implications of these transitions in complex many body physical systems.
△ Less
Submitted 19 July, 2011; v1 submitted 16 August, 2010;
originally announced August 2010.
-
Multiresolution community detection for megascale networks by information-based replica correlations
Authors:
Peter Ronhovde,
Zohar Nussinov
Abstract:
We use a Potts model community detection algorithm to accurately and quantitatively evaluate the hierarchical or multiresolution structure of a graph. Our multiresolution algorithm calculates correlations among multiple copies ("replicas") of the same graph over a range of resolutions. Significant multiresolution structures are identified by strongly correlated replicas. The average normalized m…
▽ More
We use a Potts model community detection algorithm to accurately and quantitatively evaluate the hierarchical or multiresolution structure of a graph. Our multiresolution algorithm calculates correlations among multiple copies ("replicas") of the same graph over a range of resolutions. Significant multiresolution structures are identified by strongly correlated replicas. The average normalized mutual information, the variation of information, and other measures in principle give a quantitative estimate of the "best" resolutions and indicate the relative strength of the structures in the graph. Because the method is based on information comparisons, it can in principle be used with any community detection model that can examine multiple resolutions. Our approach may be extended to other optimization problems. As a local measure, our Potts model avoids the "resolution limit" that affects other popular models. With this model, our community detection algorithm has an accuracy that ranks among the best of currently available methods. Using it, we can examine graphs over 40 million nodes and more than one billion edges. We further report that the multiresolution variant of our algorithm can solve systems of at least 200000 nodes and 10 million edges on a single processor with exceptionally high accuracy. For typical cases, we find a super-linear scaling, O(L^{1.3}) for community detection and O(L^{1.3} log N) for the multiresolution algorithm where L is the number of edges and N is the number of nodes in the system.
△ Less
Submitted 15 July, 2009; v1 submitted 5 December, 2008;
originally announced December 2008.
-
Local resolution-limit-free Potts model for community detection
Authors:
Peter Ronhovde,
Zohar Nussinov
Abstract:
We report on an exceptionally accurate spin-glass-type Potts model for community detection. With a simple algorithm, we find that our approach is at least as accurate as the best currently available algorithms and robust to the effects of noise. It is also competitive with the best currently available algorithms in terms of speed and size of solvable systems. We find that the computational dema…
▽ More
We report on an exceptionally accurate spin-glass-type Potts model for community detection. With a simple algorithm, we find that our approach is at least as accurate as the best currently available algorithms and robust to the effects of noise. It is also competitive with the best currently available algorithms in terms of speed and size of solvable systems. We find that the computational demand often exhibits superlinear scaling L^1.3 where L is the number of edges in the system, and we have applied the algorithm to synthetic systems as large as 40x10^6 nodes and over 1x10^9 edges. A previous stumbling block encountered by popular community detection methods is the so-called "resolution limit." Being a "local" measure of community structure, our Potts model is free from this resolution-limit effect, and it further remains a local measure on weighted and directed graphs. We also address the mitigation of resolution-limit effects for two other popular Potts models.
△ Less
Submitted 15 April, 2010; v1 submitted 18 March, 2008;
originally announced March 2008.
-
A Novel Approach Applied to the Largest Clique Problem
Authors:
Vladimir Gudkov,
Shmuel Nussinov,
Zohar Nussinov
Abstract:
A novel approach to complex problems has been previously applied to graph classification and the graph equivalence problem. Here we apply it to the NP complete problem of finding the largest perfect clique within a graph $G$.
A novel approach to complex problems has been previously applied to graph classification and the graph equivalence problem. Here we apply it to the NP complete problem of finding the largest perfect clique within a graph $G$.
△ Less
Submitted 17 September, 2002;
originally announced September 2002.