-
CLOVE: Travelling Salesman's approach to hyperbolic embeddings of complex networks with communities
Authors:
Sámuel G. Balogh,
Bendegúz Sulyok,
Tamás Vicsek,
Gergely Palla
Abstract:
The embedding of complex networks into metric spaces has become a research topic of high interest with a wide variety of proposed methods. Low dimensional hyperbolic spaces offer a natural co-domain for embeddings allowing a roughly uniform spatial distribution of the nodes even for scale-free networks and the efficient navigability and estimation of linking probabilities. According to recent resu…
▽ More
The embedding of complex networks into metric spaces has become a research topic of high interest with a wide variety of proposed methods. Low dimensional hyperbolic spaces offer a natural co-domain for embeddings allowing a roughly uniform spatial distribution of the nodes even for scale-free networks and the efficient navigability and estimation of linking probabilities. According to recent results, the communities of a complex network after optimization can be naturally mapped into well-defined angular sectors of the hyperbolic space. Here we introduce CLOVE, an embedding method exploiting this property based on iterative arrangement of the communities in a hierarchical manner, down to individual nodes. A crucial step in the process is finding the optimal angular order of the communities at a given level of the hierarchy, which is solved based on the Travelling Salesman Problem. Since CLOVE outperforms most of the alternative methods regarding different embedding quality measures and is computationally very efficient, it can be very useful in related down-stream machine learning tasks such as AI based pattern recognition.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
Intra-community link formation and modularity in ultracold growing hyperbolic networks
Authors:
Sámuel G. Balogh,
Gergely Palla
Abstract:
Hyperbolic network models, centered around the idea of placing nodes at random in a hyperbolic space and drawing links according to a probability that decreases as a function of the distance, provide a simple, yet also very capable framework for grasping the small-world, scale-free, highly clustered and modular nature of complex systems that are often referred to as real-world networks. In the pre…
▽ More
Hyperbolic network models, centered around the idea of placing nodes at random in a hyperbolic space and drawing links according to a probability that decreases as a function of the distance, provide a simple, yet also very capable framework for grasping the small-world, scale-free, highly clustered and modular nature of complex systems that are often referred to as real-world networks. In the present work we study the community structure of networks generated by the Popularity Similarity Optimization model (corresponding to one of the fundamental, widely known hyperbolic models) when the temperature parameter (responsible for tuning the clustering coefficient) is set to the limiting value of zero. By focusing on the intra-community link formation we derive analytical expressions for the expected modularity of a partitioning consisting of equally sized angular sectors in the native disk representation of the 2d hyperbolic space. Our formulas improve earlier results to a great extent, being able to estimate the average modularity (measured by numerical simulations) with high precision in a considerably larger range both in terms of the model parameters and also the relative size of the communities with respect to the entire network. These findings enhance our comprehension of how modules form in hyperbolic networks. The existence of these modules is somewhat unexpected, given the absence of explicit community formation steps in the model definition.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Maximally modular structure of growing hyperbolic networks
Authors:
Sámuel G. Balogh,
Bianka Kovács,
Gergely Palla
Abstract:
Hyperbolic models are remarkably good at reproducing the scale-free, highly clustered and small-world properties of networks representing real complex systems in a very simple framework. Here we show that for the popularity-similarity optimization model from this family, the generated networks become also extremely modular in the thermodynamic limit, in spite of lacking any explicit community form…
▽ More
Hyperbolic models are remarkably good at reproducing the scale-free, highly clustered and small-world properties of networks representing real complex systems in a very simple framework. Here we show that for the popularity-similarity optimization model from this family, the generated networks become also extremely modular in the thermodynamic limit, in spite of lacking any explicit community formation mechanism in the model definition. According to our analytical results supported by numerical simulations, when the system size is increased, the modularity approaches one surprisingly fast.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
Growing hyperbolic networks beyond two dimensions: the generalised popularity-similarity optimisation model
Authors:
Bianka Kovács,
Sámuel G. Balogh,
Gergely Palla
Abstract:
Hyperbolic network models have gained considerable attention in recent years, mainly due to their capability of explaining many peculiar features of real-world networks. One of the most widely known models of this type is the popularity-similarity optimisation (PSO) model, working in the native disk representation of the two-dimensional hyperbolic space and generating networks with small-world pro…
▽ More
Hyperbolic network models have gained considerable attention in recent years, mainly due to their capability of explaining many peculiar features of real-world networks. One of the most widely known models of this type is the popularity-similarity optimisation (PSO) model, working in the native disk representation of the two-dimensional hyperbolic space and generating networks with small-world property, scale-free degree distribution, high clustering and strong community structure at the same time. With the motivation of better understanding hyperbolic random graphs, we hereby introduce the $d$PSO model, a generalisation of the PSO model to any arbitrary integer dimension $d>2$. The analysis of the obtained networks shows that their major structural properties can be affected by the dimension of the underlying hyperbolic space in a non-trivial way. Our extended framework is not only interesting from a theoretical point of view but can also serve as a starting point for the generalisation of already existing two-dimensional hyperbolic embedding techniques.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
Generalized entropies, density of states, and non-extensivity
Authors:
Sámuel G. Balogh,
Gergely Palla,
Péter Pollner,
Dániel Czégel
Abstract:
The concept of entropy connects the number of possible configurations with the number of variables in large stochastic systems. Independent or weakly interacting variables render the number of configurations scale exponentially with the number of variables, making the Boltzmann-Gibbs-Shannon entropy extensive. In systems with strongly interacting variables, or with variables driven by history-depe…
▽ More
The concept of entropy connects the number of possible configurations with the number of variables in large stochastic systems. Independent or weakly interacting variables render the number of configurations scale exponentially with the number of variables, making the Boltzmann-Gibbs-Shannon entropy extensive. In systems with strongly interacting variables, or with variables driven by history-dependent dynamics, this is no longer true. Here we show that contrary to the generally held belief, not only strong correlations or history-dependence, but skewed-enough distribution of visiting probabilities, that is, first-order statistics, also play a role in determining the relation between configuration space size and system size, or, equivalently, the extensive form of generalized entropy. We present a macroscopic formalism describing this interplay between first-order statistics, higher-order statistics, and configuration space growth. We demonstrate that knowing any two strongly restricts the possibilities of the third. We believe that this unified macroscopic picture of emergent degrees of freedom constraining mechanisms provides a step towards finding order in the zoo of strongly interacting complex systems.
△ Less
Submitted 23 June, 2020;
originally announced June 2020.
-
Time evolution of the hierarchical networks between PubMed MeSH terms
Authors:
Sámuel G. Balogh,
Dániel Zagyva,
Péter Pollner,
Gergely Palla
Abstract:
Hierarchical organisation is a prevalent feature of many complex networks appearing in nature and society. A relating interesting, yet less studied question is how does a hierarchical network evolve over time? Here we take a data driven approach and examine the time evolution of the network between the Medical Subject Headings (MeSH) provided by the National Center for Biotechnology Information (N…
▽ More
Hierarchical organisation is a prevalent feature of many complex networks appearing in nature and society. A relating interesting, yet less studied question is how does a hierarchical network evolve over time? Here we take a data driven approach and examine the time evolution of the network between the Medical Subject Headings (MeSH) provided by the National Center for Biotechnology Information (NCBI, part of the U. S. National Library of Medicine). The network between the MeSH terms is organised into 16 different, yearly updated hierarchies such as "Anatomy", "Diseases", "Chemicals and Drugs", etc. The natural representation of these hierarchies is given by directed acyclic graphs, composed of links pointing from nodes higher in the hierarchy towards nodes in lower levels. Due to the yearly updates, the structure of these networks is subject to constant evolution: new MeSH terms can appear, terms becoming obsolete can be deleted or be merged with other terms, and also already existing parts of the network may be rewired. We examine various statistical properties of the time evolution, with a special focus on the attachment and detachment mechanisms of the links, and find a few general features that are characteristic for all MeSH hierarchies. According to the results, the hierarchies investigated display an interesting interplay between non-uniform preference with respect to multiple different topological and hierarchical properties.
△ Less
Submitted 27 August, 2019;
originally announced August 2019.
-
Generalised thresholding of hidden variable network models with scale-free property
Authors:
Sámuel G. Balogh,
Péter Pollner,
Gergely Palla
Abstract:
The hidden variable formalism (based on the assumption of some intrinsic node parameters) turned out to be a remarkably efficient and powerful approach in describing and analyzing the topology of complex networks. Owing to one of its most advantageous property - namely proven to be able to reproduce a wide range of different degree distribution forms - it has become a standard tool for generating…
▽ More
The hidden variable formalism (based on the assumption of some intrinsic node parameters) turned out to be a remarkably efficient and powerful approach in describing and analyzing the topology of complex networks. Owing to one of its most advantageous property - namely proven to be able to reproduce a wide range of different degree distribution forms - it has become a standard tool for generating networks having the scale-free property. One of the most intensively studied version of this model is based on a thresholding mechanism of the exponentially distributed hidden variables associated to the nodes (intrinsic vertex weights), which give rise to the emergence of a scale-free network where the degree distribution $p(k)\sim k^{-γ}$ is decaying with an exponent of $γ=2$. Here we propose a generalization and modification of this model by extending the set of connection probabilities and hidden variable distributions that lead to the aforementioned degree distribution, and analyze the conditions leading to the above behavior analytically. In addition, we propose a relaxation of the hard threshold in the connection probabilities, which opens up the possibility for obtaining sparse scale free networks with arbitrary scaling exponent.
△ Less
Submitted 10 August, 2019;
originally announced August 2019.
-
Phase space volume scaling of generalized entropies and anomalous diffusion scaling governed by corresponding non-linear Fokker-Planck equations
Authors:
Dániel Czégel,
Sámuel G Balogh,
Péter Pollner,
Gergely Palla
Abstract:
Many physical, biological or social systems are governed by history-dependent dynamics or are composed of strongly interacting units, showing an extreme diversity of microscopic behaviour. Macroscopically, however, they can be efficiently modeled by generalizing concepts of the theory of Markovian, ergodic and weakly interacting stochastic processes. In this paper, we model stochastic processes by…
▽ More
Many physical, biological or social systems are governed by history-dependent dynamics or are composed of strongly interacting units, showing an extreme diversity of microscopic behaviour. Macroscopically, however, they can be efficiently modeled by generalizing concepts of the theory of Markovian, ergodic and weakly interacting stochastic processes. In this paper, we model stochastic processes by a family of generalized Fokker-Planck equations whose stationary solutions are equivalent to the maximum entropy distributions according to generalized entropies. We show that at asymptotically large times and volumes, the scaling exponent of the anomalous diffusion process described by the generalized Fokker-Planck equation and the phase space volume scaling exponent of the generalized entropy bijectively determine each other via a simple algebraic relation. This implies that these basic measures characterizing the transient and the stationary behaviour of the processes provide the same information regarding the asymptotic regime, and consequently, the classification of the processes given by these two exponents coincide.
△ Less
Submitted 7 February, 2018; v1 submitted 9 August, 2017;
originally announced August 2017.