-
Comment on "Dynamic Opinion Model and Invasion Percolation"
Authors:
A. Sattari,
M. Paczuski,
P. Grassberger
Abstract:
In J. Shao et al., PRL 103, 108701 (2009) the authors claim that a model with majority rule coarsening exhibits in d=2 a percolation transition in the universality class of invasion percolation with trapping. In the present comment we give compelling evidence, including high statistics simulations on much larger lattices, that this is not correct. and that the model is trivially in the ordinary pe…
▽ More
In J. Shao et al., PRL 103, 108701 (2009) the authors claim that a model with majority rule coarsening exhibits in d=2 a percolation transition in the universality class of invasion percolation with trapping. In the present comment we give compelling evidence, including high statistics simulations on much larger lattices, that this is not correct. and that the model is trivially in the ordinary percolation universality class.
△ Less
Submitted 8 August, 2012;
originally announced August 2012.
-
Agglomerative Percolation on Bipartite Networks: A Novel Type of Spontaneous Symmetry Breaking
Authors:
Hon Wai Lau,
Maya Paczuski,
Peter Grassberger
Abstract:
Ordinary bond percolation (OP) can be viewed as a process where clusters grow by joining them pairwise, by adding links chosen randomly one by one from a set of predefined `virtual' links. In contrast, in agglomerative percolation (AP) clusters grow by choosing randomly a `target cluster' and joining it with all its neighbors, as defined by the same set of virtual links. Previous studies showed th…
▽ More
Ordinary bond percolation (OP) can be viewed as a process where clusters grow by joining them pairwise, by adding links chosen randomly one by one from a set of predefined `virtual' links. In contrast, in agglomerative percolation (AP) clusters grow by choosing randomly a `target cluster' and joining it with all its neighbors, as defined by the same set of virtual links. Previous studies showed that AP is in different universality classes from OP for several types of (virtual) networks (linear chains, trees, Erdos-Renyi networks), but most surprising were the results for 2-d lattices: While AP on the triangular lattice was found to be in the OP universality class, it behaved completely differently on the square lattice. In the present paper we explain this striking violation of universality by invoking bipartivity. While the square lattice is a bipartite graph, the triangular lattice is not. In conformity with this we show that AP on the honeycomb and simple cubic (3-d) lattices -- both of which are bipartite -- are also not in the OP universality classes. More precisely, we claim that this violation of universality is basically due to a Z_2 symmetry that is spontaneously broken at the percolation threshold. We also discuss AP on bipartite random networks and suitable generalizations of AP on k-partite graphs.
△ Less
Submitted 9 April, 2012; v1 submitted 5 April, 2012;
originally announced April 2012.
-
Discontinuous Percolation Transitions in Epidemic Processes, Surface Depinning in Random Media and Hamiltonian Random Graphs
Authors:
Golnoosh Bizhani,
Maya Paczuski,
Peter Grassberger
Abstract:
Discontinuous percolation transitions and the associated tricritical points are manifest in a wide range of both equilibrium and non-equilibrium cooperative phenomena. To demonstrate this, we present and relate the continuous and first order behaviors in two different classes of models: The first are generalized epidemic processes (GEP) that describe in their spatially embedded version - either on…
▽ More
Discontinuous percolation transitions and the associated tricritical points are manifest in a wide range of both equilibrium and non-equilibrium cooperative phenomena. To demonstrate this, we present and relate the continuous and first order behaviors in two different classes of models: The first are generalized epidemic processes (GEP) that describe in their spatially embedded version - either on or off a regular lattice - compact or fractal cluster growth in random media at zero temperature. A random graph version of GEP is mapped onto a model previously proposed for complex social contagion. We compute detailed phase diagrams and compare our numerical results at the tricritical point in d = 3 with field theory predictions of Janssen et al. [Phys. Rev. E 70, 026114 (2004)]. The second class consists of exponential ("Hamiltonian", or formally equilibrium) random graph models and includes the Strauss and the 2-star model, where 'chemical potentials' control the densities of links, triangles or 2-stars. When the chemical potentials in either graph model are O(logN), the percolation transition can coincide with a first order phase transition in the density of links, making the former also discontinuous. Hysteresis loops can then be of mixed order, with second order behavior for decreasing link fugacity, and a jump (first order) when it increases.
△ Less
Submitted 5 April, 2012; v1 submitted 14 February, 2012;
originally announced February 2012.
-
PageRank and rank-reversal dependence on the damping factor
Authors:
Seung-Woo Son,
Claire Christensen,
Peter Grassberger,
Maya Paczuski
Abstract:
PageRank (PR) is an algorithm originally developed by Google to evaluate the importance of web pages. Considering how deeply rooted Google's PR algorithm is to gathering relevant information or to the success of modern businesses, the question of rank-stability and choice of the damping factor (a parameter in the algorithm) is clearly important. We investigate PR as a function of the damping facto…
▽ More
PageRank (PR) is an algorithm originally developed by Google to evaluate the importance of web pages. Considering how deeply rooted Google's PR algorithm is to gathering relevant information or to the success of modern businesses, the question of rank-stability and choice of the damping factor (a parameter in the algorithm) is clearly important. We investigate PR as a function of the damping factor d on a network obtained from a domain of the World Wide Web, finding that rank-reversal happens frequently over a broad range of PR (and of d). We use three different correlation measures, Pearson, Spearman, and Kendall, to study rank-reversal as d changes, and show that the correlation of PR vectors drops rapidly as d changes from its frequently cited value, $d_0=0.85$. Rank-reversal is also observed by measuring the Spearman and Kendall rank correlation, which evaluate relative ranks rather than absolute PR. Rank-reversal happens not only in directed networks containing rank-sinks but also in a single strongly connected component, which by definition does not contain any sinks. We relate rank-reversals to rank-pockets and bottlenecks in the directed network structure. For the network studied, the relative rank is more stable by our measures around $d=0.65$ than at $d=d_0$.
△ Less
Submitted 23 January, 2012;
originally announced January 2012.
-
Sampling properties of directed networks
Authors:
Seung-Woo Son,
Claire Christensen,
Golnoosh Bizhani,
David V. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
For many real-world networks only a small "sampled" version of the original network may be investigated; those results are then used to draw conclusions about the actual system. Variants of breadth-first search (BFS) sampling, which are based on epidemic processes, are widely used. Although it is well established that BFS sampling fails, in most cases, to capture the IN-component(s) of directed ne…
▽ More
For many real-world networks only a small "sampled" version of the original network may be investigated; those results are then used to draw conclusions about the actual system. Variants of breadth-first search (BFS) sampling, which are based on epidemic processes, are widely used. Although it is well established that BFS sampling fails, in most cases, to capture the IN-component(s) of directed networks, a description of the effects of BFS sampling on other topological properties are all but absent from the literature. To systematically study the effects of sampling biases on directed networks, we compare BFS sampling to random sampling on complete large-scale directed networks. We present new results and a thorough analysis of the topological properties of seven different complete directed networks (prior to sampling), including three versions of Wikipedia, three different sources of sampled World Wide Web data, and an Internet-based social network. We detail the differences that sampling method and coverage can make to the structural properties of sampled versions of these seven networks. Most notably, we find that sampling method and coverage affect both the bow-tie structure, as well as the number and structure of strongly connected components in sampled networks. In addition, at low sampling coverage (i.e. less than 40%), the values of average degree, variance of out-degree, degree auto-correlation, and link reciprocity are overestimated by 30% or more in BFS-sampled networks, and only attain values within 10% of the corresponding values in the complete networks when sampling coverage is in excess of 65%. These results may cause us to rethink what we know about the structure, function, and evolution of real-world directed networks.
△ Less
Submitted 13 October, 2012; v1 submitted 6 January, 2012;
originally announced January 2012.
-
Random Sequential Renormalization and Agglomerative Percolation in Networks: Application to Erd"os-R'enyi and Scale-free Graphs
Authors:
Golnoosh Bizhani,
Peter Grassberger,
Maya Paczuski
Abstract:
We study the statistical behavior under random sequential renormalization(RSR) of several network models including Erd"os R'enyi (ER) graphs, scale-free networks and an annealed model (AM) related to ER graphs. In RSR the network is locally coarse grained by choosing at each renormalization step a node at random and joining it to all its neighbors. Compared to previous (quasi-)parallel renormaliza…
▽ More
We study the statistical behavior under random sequential renormalization(RSR) of several network models including Erd"os R'enyi (ER) graphs, scale-free networks and an annealed model (AM) related to ER graphs. In RSR the network is locally coarse grained by choosing at each renormalization step a node at random and joining it to all its neighbors. Compared to previous (quasi-)parallel renormalization methods [C.Song et.al], RSR allows a more fine-grained analysis of the renormalization group (RG) flow, and unravels new features, that were not discussed in the previous analyses. In particular we find that all networks exhibit a second order transition in their RG flow. This phase transition is associated with the emergence of a giant hub and can be viewed as a new variant of percolation, called agglomerative percolation. We claim that this transition exists also in previous graph renormalization schemes and explains some of the scaling laws seen there. For critical trees it happens as N/N0 -> 0 in the limit of large systems (where N0 is the initial size of the graph and N its size at a given RSR step). In contrast, it happens at finite N/N0 in sparse ER graphs and in the annealed model, while it happens for N/N0 -> 1 on scale-free networks. Critical exponents seem to depend on the type of the graph but not on the average degree and obey usual scaling relations for percolation phenomena. For the annealed model they agree with the exponents obtained from a mean-field theory. At late times, the networks exhibit a star-like structure in agreement with the results of Radicchi et. al. While degree distributions are of main interest when regarding the scheme as network renormalization, mass distributions (which are more relevant when considering 'supernodes' as clusters) are much easier to study using the fast Newman-Ziff algorithm for percolation, allowing us to obtain very high statistics.
△ Less
Submitted 12 December, 2011; v1 submitted 21 September, 2011;
originally announced September 2011.
-
Percolation Theory on Interdependent Networks Based on Epidemic Spreading
Authors:
Seung-Woo Son,
Golnoosh Bizhani,
Claire Christensen,
Peter Grassberger,
Maya Paczuski
Abstract:
We consider percolation on interdependent locally treelike networks, recently introduced by Buldyrev et al., Nature 464, 1025 (2010), and demonstrate that the problem can be simplified conceptually by deleting all references to cascades of failures. Such cascades do exist, but their explicit treatment just complicates the theory -- which is a straightforward extension of the usual epidemic spreadi…
▽ More
We consider percolation on interdependent locally treelike networks, recently introduced by Buldyrev et al., Nature 464, 1025 (2010), and demonstrate that the problem can be simplified conceptually by deleting all references to cascades of failures. Such cascades do exist, but their explicit treatment just complicates the theory -- which is a straightforward extension of the usual epidemic spreading theory on a single network. Our method has the added benefits that it is directly formulated in terms of an order parameter and its modular structure can be easily extended to other problems, e.g. to any number of interdependent networks, or to networks with dependency links.
△ Less
Submitted 20 September, 2011;
originally announced September 2011.
-
Exact solutions for mass-dependent irreversible aggregations
Authors:
Seung-Woo Son,
Claire Christensen,
Golnoosh Bizhani,
Peter Grassberger,
Maya Paczuski
Abstract:
We consider the mass-dependent aggregation process (k+1)X -> X, given a fixed number of unit mass particles in the initial state. One cluster is chosen proportional to its mass and is merged into one either with k-neighbors in one dimension, or -- in the well-mixed case -- with k other clusters picked randomly. We find the same combinatorial exact solutions for the probability to find any given co…
▽ More
We consider the mass-dependent aggregation process (k+1)X -> X, given a fixed number of unit mass particles in the initial state. One cluster is chosen proportional to its mass and is merged into one either with k-neighbors in one dimension, or -- in the well-mixed case -- with k other clusters picked randomly. We find the same combinatorial exact solutions for the probability to find any given configuration of particles on a ring or line, and in the well-mixed case. The mass distribution of a single cluster exhibits scaling laws and the finite size scaling form is given. The relation to the classical sum kernel of irreversible aggregation is discussed.
△ Less
Submitted 31 August, 2011;
originally announced September 2011.
-
Are Percolation Transitions always Sharpened by Making Networks Interdependent?
Authors:
Seung-Woo Son,
Peter Grassberger,
Maya Paczuski
Abstract:
We study a model for coupled networks introduced recently by Buldyrev et al., Nature 464, 1025 (2010), where each node has to be connected to others via two types of links to be viable. Removing a critical fraction of nodes leads to a percolation transition that has been claimed to be more abrupt than that for uncoupled networks. Indeed, it was found to be discontinuous in all cases studied. Using…
▽ More
We study a model for coupled networks introduced recently by Buldyrev et al., Nature 464, 1025 (2010), where each node has to be connected to others via two types of links to be viable. Removing a critical fraction of nodes leads to a percolation transition that has been claimed to be more abrupt than that for uncoupled networks. Indeed, it was found to be discontinuous in all cases studied. Using an efficient new algorithm we verify that the transition is discontinuous for coupled Erdos-Renyi networks, but find it to be continuous for fully interdependent diluted lattices. In 2 and 3 dimension, the order parameter exponent $β$ is larger than in ordinary percolation, showing that the transition is less sharp, i.e. further from discontinuity, than for isolated networks. Possible consequences for spatially embedded networks are discussed.
△ Less
Submitted 19 September, 2011; v1 submitted 18 August, 2011;
originally announced August 2011.
-
Explosive Percolation is Continuous, but with Unusual Finite Size Behavior
Authors:
Peter Grassberger,
Claire Christensen,
Golnoosh Bizhani,
Seung-Woo Son,
Maya Paczuski
Abstract:
We study four Achlioptas type processes with "explosive" percolation transitions. All transitions are clearly continuous, but their finite size scaling functions are not entire holomorphic. The distributions of the order parameter, the relative size $s_{\rm max}/N$ of the largest cluster, are double-humped. But -- in contrast to first order phase transitions -- the distance between the two peaks d…
▽ More
We study four Achlioptas type processes with "explosive" percolation transitions. All transitions are clearly continuous, but their finite size scaling functions are not entire holomorphic. The distributions of the order parameter, the relative size $s_{\rm max}/N$ of the largest cluster, are double-humped. But -- in contrast to first order phase transitions -- the distance between the two peaks decreases with system size $N$ as $N^{-η}$ with $η> 0$. We find different positive values of $β$ (defined via $< s_{\rm max}/N > \sim (p-p_c)^β$ for infinite systems) for each model, showing that they are all in different universality classes. In contrast, the exponent $Θ$ (defined such that observables are homogeneous functions of $(p-p_c)N^Θ$) is close to -- or even equal to -- 1/2 for all models.
△ Less
Submitted 22 March, 2011; v1 submitted 18 March, 2011;
originally announced March 2011.
-
Clustering Drives Assortativity and Community Structure in Ensembles of Networks
Authors:
David V. Foster,
Jacob G. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
Clustering, assortativity, and communities are key features of complex networks. We probe dependencies between these attributes and find that ensembles with strong clustering display both high assortativity by degree and prominent community structure, while ensembles with high assortativity are much less biased towards clustering or community structure. Further, clustered networks can amplify smal…
▽ More
Clustering, assortativity, and communities are key features of complex networks. We probe dependencies between these attributes and find that ensembles with strong clustering display both high assortativity by degree and prominent community structure, while ensembles with high assortativity are much less biased towards clustering or community structure. Further, clustered networks can amplify small homophilic bias for trait assortativity. This marked asymmetry suggests that transitivity, rather than homophily, drives the standard nonsocial/social network dichotomy.
△ Less
Submitted 5 January, 2011; v1 submitted 10 December, 2010;
originally announced December 2010.
-
Agglomerative Percolation in Two Dimensions
Authors:
Claire Christensen,
Golnoosh Bizhani,
Seung-Woo Son,
Maya Paczuski,
Peter Grassberger
Abstract:
We study a process termed "agglomerative percolation" (AP) in two dimensions. Instead of adding sites or bonds at random, in AP randomly chosen clusters are linked to all their neighbors. As a result the growth process involves a diverging length scale near a critical point. Picking target clusters with probability proportional to their mass leads to a runaway compact cluster. Choosing all cluster…
▽ More
We study a process termed "agglomerative percolation" (AP) in two dimensions. Instead of adding sites or bonds at random, in AP randomly chosen clusters are linked to all their neighbors. As a result the growth process involves a diverging length scale near a critical point. Picking target clusters with probability proportional to their mass leads to a runaway compact cluster. Choosing all clusters equally leads to a continuous transition in a new universality class for the square lattice, while the transition on the triangular lattice has the same critical exponents as ordinary percolation.
△ Less
Submitted 9 November, 2011; v1 submitted 5 December, 2010;
originally announced December 2010.
-
Irreversible Aggregation and Network Renormalization
Authors:
Seung-Woo Son,
Golnoosh Bizhani,
Claire Christensen,
Peter Grassberger,
Maya Paczuski
Abstract:
Irreversible aggregation is revisited in view of recent work on renormalization of complex networks. Its scaling laws and phase transitions are related to percolation transitions seen in the latter. We illustrate our points by giving the complete solution for the probability to find any given state in an aggregation process $(k+1)X\to X$, given a fixed number of unit mass particles in the initial…
▽ More
Irreversible aggregation is revisited in view of recent work on renormalization of complex networks. Its scaling laws and phase transitions are related to percolation transitions seen in the latter. We illustrate our points by giving the complete solution for the probability to find any given state in an aggregation process $(k+1)X\to X$, given a fixed number of unit mass particles in the initial state. Exactly the same probability distributions and scaling are found in one dimensional systems (a trivial network) and well-mixed solutions. This reveals that scaling laws found in renormalization of complex networks do not prove that they are self-similar.
△ Less
Submitted 3 March, 2011; v1 submitted 3 November, 2010;
originally announced November 2010.
-
Random Sequential Renormalization of Networks I: Application to Critical Trees
Authors:
Golnoosh Bizhani,
Vishal Sood,
Maya Paczuski,
Peter Grassberger
Abstract:
We introduce the concept of Random Sequential Renormalization (RSR) for arbitrary networks. RSR is a graph renormalization procedure that locally aggregates nodes to produce a coarse grained network. It is analogous to the (quasi-)parallel renormalization schemes introduced by C. Song {\it et al.} (Nature {\bf 433}, 392 (2005)) and studied more recently by F. Radicchi {\it et al.} (Phys. Rev. Lett…
▽ More
We introduce the concept of Random Sequential Renormalization (RSR) for arbitrary networks. RSR is a graph renormalization procedure that locally aggregates nodes to produce a coarse grained network. It is analogous to the (quasi-)parallel renormalization schemes introduced by C. Song {\it et al.} (Nature {\bf 433}, 392 (2005)) and studied more recently by F. Radicchi {\it et al.} (Phys. Rev. Lett. {\bf 101}, 148701 (2008)), but much simpler and easier to implement. In this first paper we apply RSR to critical trees and derive analytical results consistent with numerical simulations. Critical trees exhibit three regimes in their evolution under RSR: (i) An initial regime $N_0^ν\lesssim N<N_0$, where $N$ is the number of nodes at some step in the renormalization and $N_0$ is the initial size. RSR in this regime is described by a mean field theory and fluctuations from one realization to another are small. The exponent $ν=1/2$ is derived using random walk arguments. The degree distribution becomes broader under successive renormalization -- reaching a power law, $p_k\sim 1/k^γ$ with $γ=2$ and a variance that diverges as $N_0^{1/2}$ at the end of this regime. Both of these results are derived based on a scaling theory. (ii) An intermediate regime for $N_0^{1/4}\lesssim N \lesssim N_0^{1/2}$, in which hubs develop, and fluctuations between different realizations of the RSR are large. Crossover functions exhibiting finite size scaling, in the critical region $N\sim N_0^{1/2} \to \infty$, connect the behaviors in the first two regimes. (iii) The last regime, for $1 \ll N\lesssim N_0^{1/4}$, is characterized by the appearance of star configurations with a central hub surrounded by many leaves. The distribution of sizes where stars first form is found numerically to be a power law up to a cutoff that scales as $N_0^{ν_{star}}$ with $ν_{star}\approx 1/4$.
△ Less
Submitted 23 March, 2011; v1 submitted 20 September, 2010;
originally announced September 2010.
-
Sequence alignment, mutual information, and dissimilarity measures for constructing phylogenies
Authors:
Orion Penner,
Peter Grassberger,
Maya Paczuski
Abstract:
Existing sequence alignment algorithms use heuristic scoring schemes which cannot be used as objective distance metrics. Therefore one relies on measures like the p- or log-det distances, or makes explicit, and often simplistic, assumptions about sequence evolution. Information theory provides an alternative, in the form of mutual information (MI) which is, in principle, an objective and model ind…
▽ More
Existing sequence alignment algorithms use heuristic scoring schemes which cannot be used as objective distance metrics. Therefore one relies on measures like the p- or log-det distances, or makes explicit, and often simplistic, assumptions about sequence evolution. Information theory provides an alternative, in the form of mutual information (MI) which is, in principle, an objective and model independent similarity measure. MI can be estimated by concatenating and zipping sequences, yielding thereby the "normalized compression distance". So far this has produced promising results, but with uncontrolled errors. We describe a simple approach to get robust estimates of MI from global pairwise alignments. Using standard alignment algorithms, this gives for animal mitochondrial DNA estimates that are strikingly close to estimates obtained from the alignment free methods mentioned above. Our main result uses algorithmic (Kolmogorov) information theory, but we show that similar results can also be obtained from Shannon theory. Due to the fact that it is not additive, normalized compression distance is not an optimal metric for phylogenetics, but we propose a simple modification that overcomes the issue of additivity. We test several versions of our MI based distance measures on a large number of randomly chosen quartets and demonstrate that they all perform better than traditional measures like the Kimura or log-det (resp. paralinear) distances. Even a simplified version based on single letter Shannon entropies, which can be easily incorporated in existing software packages, gave superior results throughout the entire animal kingdom. But we see the main virtue of our approach in a more general way. For example, it can also help to judge the relative merits of different alignment algorithms, by estimating the significance of specific alignments.
△ Less
Submitted 19 August, 2010;
originally announced August 2010.
-
The Interacting Branching Process as a Simple Model of Innovation
Authors:
Vishal Sood,
Myléne Mathieu,
Amer Shreim,
Peter Grassberger,
Maya Paczuski
Abstract:
We describe innovation in terms of a generalized branching process. Each new invention pairs with any existing one to produce a number of offspring, which is Poisson distributed with mean p. Existing inventions die with probability p/τat each generation. In contrast to mean field results, no phase transition occurs; the chance for survival is finite for all p > 0. For τ= \infty, surviving processe…
▽ More
We describe innovation in terms of a generalized branching process. Each new invention pairs with any existing one to produce a number of offspring, which is Poisson distributed with mean p. Existing inventions die with probability p/τat each generation. In contrast to mean field results, no phase transition occurs; the chance for survival is finite for all p > 0. For τ= \infty, surviving processes exhibit a bottleneck before exploding super-exponentially - a growth consistent with a law of accelerating returns. This behavior persists for finite τ. We analyze, in detail, the asymptotic behavior as p \to 0.
△ Less
Submitted 17 September, 2010; v1 submitted 30 March, 2010;
originally announced March 2010.
-
Attractor and Basin Entropies of Random Boolean Networks Under Asynchronous Stochastic Update
Authors:
Amer Shreim,
Andrew Berdahl,
Florian Greil,
Jörn Davidsen,
Maya Paczuski
Abstract:
We introduce a numerical method to study random Boolean networks with asynchronous stochas- tic update. Each node in the network of states starts with equal occupation probability and this probability distribution then evolves to a steady state. Nodes left with finite occupation probability determine the attractors and the sizes of their basins. As for synchronous update, the basin entropy grows w…
▽ More
We introduce a numerical method to study random Boolean networks with asynchronous stochas- tic update. Each node in the network of states starts with equal occupation probability and this probability distribution then evolves to a steady state. Nodes left with finite occupation probability determine the attractors and the sizes of their basins. As for synchronous update, the basin entropy grows with system size only for critical networks, where the distribution of attractor lengths is a power law. We determine analytically the distribution for the number of attractors and basin sizes for frozen networks with connectivity K = 1.
△ Less
Submitted 10 March, 2010;
originally announced March 2010.
-
Clustering Phase Transitions and Hysteresis: Pitfalls in Constructing Network Ensembles
Authors:
David V. Foster,
Jacob G. Foster,
Maya Paczuski,
Peter Grassberger
Abstract:
Ensembles of networks are used as null models in many applications. However, simple null models often show much less clustering than their real-world counterparts. In this paper, we study a model where clustering is enhanced by means of a fugacity term as in the Strauss (or "triangle") model, but where the degree sequence is strictly preserved -- thus maintaining the quenched heterogeneity of no…
▽ More
Ensembles of networks are used as null models in many applications. However, simple null models often show much less clustering than their real-world counterparts. In this paper, we study a model where clustering is enhanced by means of a fugacity term as in the Strauss (or "triangle") model, but where the degree sequence is strictly preserved -- thus maintaining the quenched heterogeneity of nodes found in the original degree sequence. Similar models had been proposed previously in [R. Milo et al., Science 298, 824 (2002)]. We find that our model exhibits phase transitions as the fugacity is changed. For regular graphs (identical degrees for all nodes) with degree k > 2 we find a single first order transition. For all non-regular networks that we studied (including Erdos - Renyi and scale-free networks) we find multiple jumps resembling first order transitions, together with strong hysteresis. The latter transitions are driven by the sudden emergence of "cluster cores": groups of highly interconnected nodes with higher than average degrees. To study these cluster cores visually, we introduce q-clique adjacency plots. We find that these cluster cores constitute distinct communities which emerge spontaneously from the triangle generating process. Finally, we point out that cluster cores produce pitfalls when using the present (and similar) models as null models for strongly clustered networks, due to the very strong hysteresis which effectively leads to broken ergodicity on realistic time scales.
△ Less
Submitted 11 November, 2009;
originally announced November 2009.
-
Activity Dependent Branching Ratios in Stocks, Solar X-ray Flux, and the Bak-Tang-Wiesenfeld Sandpile Model
Authors:
Elliot Martin,
Amer Shreim,
Maya Paczuski
Abstract:
We define an activity dependent branching ratio that allows comparison of different time series $X_{t}$. The branching ratio $b_x$ is defined as $b_x= E[ξ_x/x]$. The random variable $ξ_x$ is the value of the next signal given that the previous one is equal to $x$, so $ξ_x=\{X_{t+1}|X_t=x\}$. If $b_x>1$, the process is on average supercritical when the signal is equal to $x$, while if $b_x<1$, it…
▽ More
We define an activity dependent branching ratio that allows comparison of different time series $X_{t}$. The branching ratio $b_x$ is defined as $b_x= E[ξ_x/x]$. The random variable $ξ_x$ is the value of the next signal given that the previous one is equal to $x$, so $ξ_x=\{X_{t+1}|X_t=x\}$. If $b_x>1$, the process is on average supercritical when the signal is equal to $x$, while if $b_x<1$, it is subcritical. For stock prices we find $b_x=1$ within statistical uncertainty, for all $x$, consistent with an ``efficient market hypothesis''. For stock volumes, solar X-ray flux intensities, and the Bak-Tang-Wiesenfeld (BTW) sandpile model, $b_x$ is supercritical for small values of activity and subcritical for the largest ones, indicating a tendency to return to a typical value. For stock volumes this tendency has an approximate power law behavior. For solar X-ray flux and the BTW model, there is a broad regime of activity where $b_x \simeq 1$, which we interpret as an indicator of critical behavior. This is true despite different underlying probability distributions for $X_t$, and for $ξ_x$. For the BTW model the distribution of $ξ_x$ is Gaussian, for $x$ sufficiently larger than one, and its variance grows linearly with $x$. Hence, the activity in the BTW model obeys a central limit theorem when sampling over past histories. The broad region of activity where $b_x$ is close to one disappears once bulk dissipation is introduced in the BTW model -- supporting our hypothesis that it is an indicator of criticality.
△ Less
Submitted 13 October, 2009;
originally announced October 2009.
-
Edge direction and the structure of networks
Authors:
Jacob G. Foster,
David V. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
Directed networks are ubiquitous and are necessary to represent complex systems with asymmetric interactions---from food webs to the World Wide Web. Despite the importance of edge direction for detecting local and community structure, it has been disregarded in studying a basic type of global diversity in networks: the tendency of nodes with similar numbers of edges to connect. This tendency, call…
▽ More
Directed networks are ubiquitous and are necessary to represent complex systems with asymmetric interactions---from food webs to the World Wide Web. Despite the importance of edge direction for detecting local and community structure, it has been disregarded in studying a basic type of global diversity in networks: the tendency of nodes with similar numbers of edges to connect. This tendency, called assortativity, affects crucial structural and dynamic properties of real-world networks, such as error tolerance or epidemic spreading. Here we demonstrate that edge direction has profound effects on assortativity. We define a set of four directed assortativity measures and assign statistical significance by comparison to randomized networks. We apply these measures to three network classes---online/social networks, food webs, and word-adjacency networks. Our measures (i) reveal patterns common to each class, (ii) separate networks that have been previously classified together, and (iii) expose limitations of several existing theoretical models. We reject the standard classification of directed networks as purely assortative or disassortative. Many display a class-specific mixture, likely reflecting functional or historical constraints, contingencies, and forces guiding the system's evolution.
△ Less
Submitted 7 November, 2010; v1 submitted 28 August, 2009;
originally announced August 2009.
-
Random sampling vs. exact enumeration of attractors in random Boolean networks
Authors:
Andrew Berdahl,
Amer Shreim,
Vishal Sood,
Maya Paczuski,
Joern Davidsen
Abstract:
We clarify the effect different sampling methods and weighting schemes have on the statistics of attractors in ensembles of random Boolean networks (RBNs). We directly measure cycle lengths of attractors and sizes of basins of attraction in RBNs using exact enumeration of the state space. In general, the distribution of attractor lengths differs markedly from that obtained by randomly choosing a…
▽ More
We clarify the effect different sampling methods and weighting schemes have on the statistics of attractors in ensembles of random Boolean networks (RBNs). We directly measure cycle lengths of attractors and sizes of basins of attraction in RBNs using exact enumeration of the state space. In general, the distribution of attractor lengths differs markedly from that obtained by randomly choosing an initial state and following the dynamics to reach an attractor. Our results indicate that the former distribution decays as a power-law with exponent 1 for all connectivities $K>1$ in the infinite system size limit. In contrast, the latter distribution decays as a power law only for K=2. This is because the mean basin size grows linearly with the attractor cycle length for $K>2$, and is statistically independent of the cycle length for K=2. We also find that the histograms of basin sizes are strongly peaked at integer multiples of powers of two for $K<3$.
△ Less
Submitted 24 April, 2009;
originally announced April 2009.
-
Sequence alignment and mutual information
Authors:
Orion Penner,
Peter Grassberger,
Maya Paczuski
Abstract:
Background: Alignment of biological sequences such as DNA, RNA or proteins is one of the most widely used tools in computational bioscience. All existing alignment algorithms rely on heuristic scoring schemes based on biological expertise. Therefore, these algorithms do not provide model independent and objective measures for how similar two (or more) sequences actually are. Although information…
▽ More
Background: Alignment of biological sequences such as DNA, RNA or proteins is one of the most widely used tools in computational bioscience. All existing alignment algorithms rely on heuristic scoring schemes based on biological expertise. Therefore, these algorithms do not provide model independent and objective measures for how similar two (or more) sequences actually are. Although information theory provides such a similarity measure -- the mutual information (MI) -- previous attempts to connect sequence alignment and information theory have not produced realistic estimates for the MI from a given alignment.
Results: Here we describe a simple and flexible approach to get robust estimates of MI from {\it global} alignments. For mammalian mitochondrial DNA, our approach gives pairwise MI estimates for commonly used global alignment algorithms that are strikingly close to estimates obtained by an entirely unrelated approach -- concatenating and zipping the sequences.
Conclusions: This remarkable consistency may help establish MI as a reliable tool for evaluating the quality of global alignments, judging the relative merits of different alignment algorithms, and estimating the significance of specific alignments. We expect that our approach can be extended to establish further connections between information theory and sequence alignment, including applications to local and multiple alignment procedures.
△ Less
Submitted 23 October, 2008;
originally announced October 2008.
-
Reinforced walks in two and three dimensions
Authors:
Jacob G. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
In probability theory, reinforced walks are random walks on a lattice (or more generally a graph) that preferentially revisit neighboring `locations' (sites or bonds) that have been visited before. In this paper, we consider walks with one-step reinforcement, where one preferentially \emph{revisits} locations irrespective of the number of visits. Previous numerical simulations [A. Ordemann {\it…
▽ More
In probability theory, reinforced walks are random walks on a lattice (or more generally a graph) that preferentially revisit neighboring `locations' (sites or bonds) that have been visited before. In this paper, we consider walks with one-step reinforcement, where one preferentially \emph{revisits} locations irrespective of the number of visits. Previous numerical simulations [A. Ordemann {\it et al.}, Phys. Rev. E {\bf 64}, 046117 (2001)] suggested that the site model on the lattice shows a phase transition at finite reinforcement between a random-walk like and a collapsed phase, in both 2 and 3 dimensions. The very different mathematical structure of bond and site models might also suggest different phenomenology (critical properties, etc.). We use high statistics simulations and heuristic arguments to suggest that site and bond reinforcement are in the same universality class, and that the purported phase transition in 2 dimensions actually occurs at zero coupling constant. We also show that a quasi-static approximation predicts the large time scaling of the end-to-end distance in the collapsed phase of both site and bond reinforcement models, in excellent agreement with simulation results.
△ Less
Submitted 8 July, 2008;
originally announced July 2008.
-
Reply to Comment on ``Analysis of the spatial distribution between successive earthquakes''
Authors:
J. Davidsen,
M. Paczuski
Abstract:
This is a reply to the Comment on ``Analysis of the spatial distribution between successive earthquakes'' by Maximilian Jonas Werner and Didier Sornette.
This is a reply to the Comment on ``Analysis of the spatial distribution between successive earthquakes'' by Maximilian Jonas Werner and Didier Sornette.
△ Less
Submitted 6 May, 2008;
originally announced May 2008.
-
Avalanches, branching ratios, and clustering of attractors in Random Boolean Networks and in the segment polarity network of \emph{Drosophila}
Authors:
Andrew Berdahl,
Amer Shreim,
Vishal Sood,
Joern Davidsen,
Maya Paczuski
Abstract:
We discuss basic features of emergent complexity in dynamical systems far from equilibrium by focusing on the network structure of their state space. We start by measuring the distributions of avalanche and transient times in Random Boolean Networks (RBNs) and in the \emph{Drosophila} polarity network by exact enumeration. A transient time is the duration of the transient from a starting state t…
▽ More
We discuss basic features of emergent complexity in dynamical systems far from equilibrium by focusing on the network structure of their state space. We start by measuring the distributions of avalanche and transient times in Random Boolean Networks (RBNs) and in the \emph{Drosophila} polarity network by exact enumeration. A transient time is the duration of the transient from a starting state to an attractor. An avalanche is a special transient which starts as single Boolean element perturbation of an attractor state. Significant differences at short times between the avalanche and the transient times for RBNs with small connectivity $K$ -- compared to the number of elements $N$ -- indicate that attractors tend to cluster in configuration space. In addition, one bit flip has a non-negligible chance to put an attractor state directly onto another attractor. This clustering is also present in the segment polarity gene network of \emph{Drosophila melanogaster}, suggesting that this may be a robust feature of biological regulatory networks. We also define and measure a branching ratio for the state space networks and find evidence for a new time scale that diverges roughly linearly with $N$ for $2\leq K \ll N$. Analytic arguments show that this time scale does not appear in the random map nor can the random map exhibit clustering of attractors. We further show that for K=2 the branching ratio exhibits the largest variation with distance from the attractor compared to other values of $K$ and that the avalanche durations exhibit no characteristic scale within our statistical resolution. Hence, we propose that the branching ratio and the avalanche duration are new indicators for scale-free behavior that may or may not be found simultaneously with other indicators of emergent complexity in extended, deterministic dynamical systems.
△ Less
Submitted 2 May, 2008;
originally announced May 2008.
-
Complex Network Analysis of State Spaces for Random Boolean Networks
Authors:
Amer Shreim,
Andrew Berdahl,
Vishal Sood,
Peter Grassberger,
Maya Paczuski
Abstract:
We apply complex network analysis to the state spaces of random Boolean networks (RBNs). An RBN contains $N$ Boolean elements each with $K$ inputs. A directed state space network (SSN) is constructed by linking each dynamical state, represented as a node, to its temporal successor. We study the heterogeneity of an SSN at both local and global scales, as well as sample-to-sample fluctuations with…
▽ More
We apply complex network analysis to the state spaces of random Boolean networks (RBNs). An RBN contains $N$ Boolean elements each with $K$ inputs. A directed state space network (SSN) is constructed by linking each dynamical state, represented as a node, to its temporal successor. We study the heterogeneity of an SSN at both local and global scales, as well as sample-to-sample fluctuations within an ensemble of SSNs. We use in-degrees of nodes as a local topological measure, and the path diversity [Phys. Rev. Lett. 98, 198701 (2007)] of an SSN as a global topological measure. RBNs with $2 \leq K \leq 5$ exhibit non-trivial fluctuations at both local and global scales, while K=2 exhibits the largest sample-to-sample, possibly non-self-averaging, fluctuations. We interpret the observed ``multi scale'' fluctuations in the SSNs as indicative of the criticality and complexity of K=2 RBNs. ``Garden of Eden'' (GoE) states are nodes on an SSN that have in-degree zero. While in-degrees of non-GoE nodes for $K>1$ SSNs can assume any integer value between 0 and $2^N$, for K=1 all the non-GoE nodes in an SSN have the same in-degree which is always a power of two.
△ Less
Submitted 28 November, 2007; v1 submitted 2 October, 2007;
originally announced October 2007.
-
Node similarity within subgraphs of protein interaction networks
Authors:
Orion Penner,
Vishal Sood,
Gabe Musso,
Kim Baskerville,
Peter Grassberger,
Maya Paczuski
Abstract:
We propose a biologically motivated quantity, twinness, to evaluate local similarity between nodes in a network. The twinness of a pair of nodes is the number of connected, labeled subgraphs of size n in which the two nodes possess identical neighbours. The graph animal algorithm is used to estimate twinness for each pair of nodes (for subgraph sizes n=4 to n=12) in four different protein intera…
▽ More
We propose a biologically motivated quantity, twinness, to evaluate local similarity between nodes in a network. The twinness of a pair of nodes is the number of connected, labeled subgraphs of size n in which the two nodes possess identical neighbours. The graph animal algorithm is used to estimate twinness for each pair of nodes (for subgraph sizes n=4 to n=12) in four different protein interaction networks (PINs). These include an Escherichia coli PIN and three Saccharomyces cerevisiae PINs -- each obtained using state-of-the-art high throughput methods. In almost all cases, the average twinness of node pairs is vastly higher than expected from a null model obtained by switching links. For all n, we observe a difference in the ratio of type A twins (which are unlinked pairs) to type B twins (which are linked pairs) distinguishing the prokaryote E. coli from the eukaryote S. cerevisiae. Interaction similarity is expected due to gene duplication, and whole genome duplication paralogues in S. cerevisiae have been reported to co-cluster into the same complexes. Indeed, we find that these paralogous proteins are over-represented as twins compared to pairs chosen at random. These results indicate that twinness can detect ancestral relationships from currently available PIN data.
△ Less
Submitted 17 August, 2007; v1 submitted 13 July, 2007;
originally announced July 2007.
-
Graph animals, subgraph sampling and motif search in large networks
Authors:
Kim Baskerville,
Peter Grassberger,
Maya Paczuski
Abstract:
We generalize a sampling algorithm for lattice animals (connected clusters on a regular lattice) to a Monte Carlo algorithm for `graph animals', i.e. connected subgraphs in arbitrary networks. As with the algorithm in [N. Kashtan et al., Bioinformatics 20, 1746 (2004)], it provides a weighted sample, but the computation of the weights is much faster (linear in the size of subgraphs, instead of s…
▽ More
We generalize a sampling algorithm for lattice animals (connected clusters on a regular lattice) to a Monte Carlo algorithm for `graph animals', i.e. connected subgraphs in arbitrary networks. As with the algorithm in [N. Kashtan et al., Bioinformatics 20, 1746 (2004)], it provides a weighted sample, but the computation of the weights is much faster (linear in the size of subgraphs, instead of super-exponential). This allows subgraphs with up to ten or more nodes to be sampled with very high statistics, from arbitrarily large networks. Using this together with a heuristic algorithm for rapidly classifying isomorphic graphs, we present results for two protein interaction networks obtained using the TAP high throughput method: one of Escherichia coli with 230 nodes and 695 links, and one for yeast (Saccharomyces cerevisiae) with roughly ten times more nodes and links. We find in both cases that most connected subgraphs are strong motifs (Z-scores >10) or anti-motifs (Z-scores <-10) when the null model is the ensemble of networks with fixed degree sequence. Strong differences appear between the two networks, with dominant motifs in E. coli being (nearly) bipartite graphs and having many pairs of nodes which connect to the same neighbors, while dominant motifs in yeast tend towards completeness or contain large cliques. We also explore a number of methods that do not rely on measurements of Z-scores or comparisons with null models. For instance, we discuss the influence of specific complexes like the 26S proteasome in yeast, where a small number of complexes dominate the $k$-cores with large k and have a decisive effect on the strongest motifs with 6 to 8 nodes. We also present Zipf plots of counts versus rank. They show broad distributions that are not power laws, in contrast to the case when disconnected subgraphs are included.
△ Less
Submitted 22 June, 2007; v1 submitted 13 February, 2007;
originally announced February 2007.
-
Networks of Recurrent Events, a Theory of Records, and an Application to Finding Causal Signatures in Seismicity
Authors:
J. Davidsen,
P. Grassberger,
M. Paczuski
Abstract:
We propose a method to search for signs of causal structure in spatiotemporal data making minimal a priori assumptions about the underlying dynamics. To this end, we generalize the elementary concept of recurrence for a point process in time to recurrent events in space and time. An event is defined to be a recurrence of any previous event if it is closer to it in space than all the intervening…
▽ More
We propose a method to search for signs of causal structure in spatiotemporal data making minimal a priori assumptions about the underlying dynamics. To this end, we generalize the elementary concept of recurrence for a point process in time to recurrent events in space and time. An event is defined to be a recurrence of any previous event if it is closer to it in space than all the intervening events. As such, each sequence of recurrences for a given event is a record breaking process. This definition provides a strictly data driven technique to search for structure. Defining events to be nodes, and linking each event to its recurrences, generates a network of recurrent events. Significant deviations in properties of that network compared to networks arising from random processes allows one to infer attributes of the causal dynamics that generate observable correlations in the patterns. We derive analytically a number of properties for the network of recurrent events composed by a random process. We extend the theory of records to treat not only the variable where records happen, but also time as continuous. In this way, we construct a fully symmetric theory of records leading to a number of new results. Those analytic results are compared to the properties of a network synthesized from earthquakes in Southern California. Significant disparities from the ensemble of acausal networks that can be plausibly attributed to the causal structure of seismicity are: (1) Invariance of network statistics with the time span of the events considered, (2) Appearance of a fundamental length scale for recurrences, independent of the time span of the catalog, which is consistent with observations of the ``rupture length'', (3) Hierarchy in the distances and times of subsequent recurrences.
△ Less
Submitted 29 April, 2008; v1 submitted 16 January, 2007;
originally announced January 2007.
-
Self-Organized Criticality and Intermittent Turbulence in an MHD Current Sheet with a Threshold Instability
Authors:
Alexander J. Klimas,
Vadim M. Uritsky,
Maya Paczuski
Abstract:
We report numerical evidence of a self-organized criticality (SOC) and intermittent turbulence (IT) symbiosis in a resistive magnetohydrodynamic (MHD) current sheet model that includes a local hysteretic switch to capture plasma physical processes outside of MHD that are described in the model as current-dependent resistivity. Results from numerical simulations show scale-free avalanches of magn…
▽ More
We report numerical evidence of a self-organized criticality (SOC) and intermittent turbulence (IT) symbiosis in a resistive magnetohydrodynamic (MHD) current sheet model that includes a local hysteretic switch to capture plasma physical processes outside of MHD that are described in the model as current-dependent resistivity. Results from numerical simulations show scale-free avalanches of magnetic energy dissipation characteristic of SOC, as well as multi-scaling in the velocity field numerically indistinguishable from certain hierarchical turbulence theories. We suggest that SOC and IT may be complementary descriptions of dynamical states realized by driven current sheets -- which occur ubiquitously in astrophysical and space plasmas.
△ Less
Submitted 8 January, 2009; v1 submitted 16 January, 2007;
originally announced January 2007.
-
Network Analysis of the State Space of Discrete Dynamical Systems
Authors:
Amer Shreim,
Peter Grassberger,
Walter Nadler,
Björn Samuelsson,
Joshua E. S. Socolar,
Maya Paczuski
Abstract:
We study networks representing the dynamics of elementary 1-d cellular automata (CA) on finite lattices. We analyze scaling behaviors of both local and global network properties as a function of system size. The scaling of the largest node in-degree is obtained analytically for a variety of CA including rules 22, 54 and 110. We further define the \emph{path diversity} as a global network measure…
▽ More
We study networks representing the dynamics of elementary 1-d cellular automata (CA) on finite lattices. We analyze scaling behaviors of both local and global network properties as a function of system size. The scaling of the largest node in-degree is obtained analytically for a variety of CA including rules 22, 54 and 110. We further define the \emph{path diversity} as a global network measure. The co-appearance of non-trivial scaling in both hub size and path diversity separates simple dynamics from the more complex behaviors typically found in Wolfram's Class IV and some Class III CA.
△ Less
Submitted 6 March, 2007; v1 submitted 16 October, 2006;
originally announced October 2006.
-
Link and subgraph likelihoods in random undirected networks with fixed and partially fixed degree sequence
Authors:
Jacob G. Foster,
David V. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
The simplest null models for networks, used to distinguish significant features of a particular network from {\it a priori} expected features, are random ensembles with the degree sequence fixed by the specific network of interest. These "fixed degree sequence" (FDS) ensembles are, however, famously resistant to analytic attack. In this paper we introduce ensembles with partially-fixed degree se…
▽ More
The simplest null models for networks, used to distinguish significant features of a particular network from {\it a priori} expected features, are random ensembles with the degree sequence fixed by the specific network of interest. These "fixed degree sequence" (FDS) ensembles are, however, famously resistant to analytic attack. In this paper we introduce ensembles with partially-fixed degree sequences (PFDS) and compare analytic results obtained for them with Monte Carlo results for the FDS ensemble. These results include link likelihoods, subgraph likelihoods, and degree correlations. We find that local structural features in the FDS ensemble can be reasonably well estimated by simultaneously fixing only the degrees of few nodes, in addition to the total number of nodes and links. As test cases we use a food web, two protein interaction networks (\textit{E. coli, S. cerevisiae}), the internet on the autonomous system (AS) level, and the World Wide Web. Fixing just the degrees of two nodes gives the mean neighbor degree as a function of node degree, $<k'>_k$, in agreement with results explicitly obtained from rewiring. For power law degree distributions, we derive the disassortativity analytically. In the PFDS ensemble the partition function can be expanded diagrammatically. We obtain an explicit expression for the link likelihood to lowest order, which reduces in the limit of large, sparse undirected networks with $L$ links and with $k_{\rm max} \ll L$ to the simple formula $P(k,k') = kk'/(2L + kk')$. In a similar limit, the probability for three nodes to be linked into a triangle reduces to the factorized expression $P_Δ(k_1,k_2,k_3) = P(k_1,k_2)P(k_1,k_3)P(k_2,k_3)$.
△ Less
Submitted 16 June, 2007; v1 submitted 16 October, 2006;
originally announced October 2006.
-
Coexistence of Self-Organized Criticality and Intermittent Turbulence in the Solar Corona
Authors:
Vadim M. Uritsky,
Maya Paczuski,
Joseph M. Davila,
Shaela I. Jones
Abstract:
An extended data set of extreme ultraviolet images of the solar corona provided by the SOHO spacecraft are analyzed using statistical methods common to studies of self-organized criticality (SOC) and intermittent turbulence (IT). The data exhibits simultaneous hallmarks of both regimes, namely power law avalanche statistics as well as multiscaling of structure functions for spatial activity. Thi…
▽ More
An extended data set of extreme ultraviolet images of the solar corona provided by the SOHO spacecraft are analyzed using statistical methods common to studies of self-organized criticality (SOC) and intermittent turbulence (IT). The data exhibits simultaneous hallmarks of both regimes, namely power law avalanche statistics as well as multiscaling of structure functions for spatial activity. This implies that both SOC and IT may be manifestations of a single complex dynamical process entangling avalanches of magnetic energy dissipation with turbulent particle flows.
△ Less
Submitted 25 May, 2007; v1 submitted 4 October, 2006;
originally announced October 2006.
-
Subgraph Ensembles and Motif Discovery Using a New Heuristic for Graph Isomorphism
Authors:
Kim Baskerville,
Maya Paczuski
Abstract:
A new heuristic based on vertex invariants is developed to rapidly distinguish non-isomorphic graphs to a desired level of accuracy. The method is applied to sample subgraphs from an E.coli protein interaction network, and as a probe for discovery of extended motifs. The network's structure is described using statistical properties of its $N$-node subgraphs for $N\leq 14$. The Zipf plots for sub…
▽ More
A new heuristic based on vertex invariants is developed to rapidly distinguish non-isomorphic graphs to a desired level of accuracy. The method is applied to sample subgraphs from an E.coli protein interaction network, and as a probe for discovery of extended motifs. The network's structure is described using statistical properties of its $N$-node subgraphs for $N\leq 14$. The Zipf plots for subgraph occurrences are robust power laws that do not change when rewiring the network while fixing the degree sequence -- although the specific subgraphs may exchange ranks. However the exponent depends on $N$. The study of larger subgraphs highlights some striking patterns for various $N$. Motifs, or connected pieces that are over-abundant in the ensemble of subgraphs, have more edges, for a given number of nodes, than antimotifs and generally display a bipartite structure or tend towards a complete graph. In contrast, antimotifs, which are under-abundant connected pieces, are mostly trees or contain at most a single, small loop. The extension to directed graphs is straightforward.
△ Less
Submitted 19 June, 2006; v1 submitted 19 June, 2006;
originally announced June 2006.
-
Earthquake recurrence as a record breaking process
Authors:
Joern Davidsen,
Peter Grassberger,
Maya Paczuski
Abstract:
Extending the central concept of recurrence times for a point process to recurrent events in space-time allows us to characterize seismicity as a record breaking process using only spatiotemporal relations among events. Linking record breaking events with edges between nodes in a graph generates a complex dynamical network isolated from any length, time or magnitude scales set by the observer. F…
▽ More
Extending the central concept of recurrence times for a point process to recurrent events in space-time allows us to characterize seismicity as a record breaking process using only spatiotemporal relations among events. Linking record breaking events with edges between nodes in a graph generates a complex dynamical network isolated from any length, time or magnitude scales set by the observer. For Southern California, the network of recurrences reveals new statistical features of seismicity with robust scaling laws. The rupture length and its scaling with magnitude emerges as a generic measure for distance between recurrent events. Further, the relative separations for subsequent records in space (or time) form a hierarchy with unexpected scaling properties.
△ Less
Submitted 28 June, 2006; v1 submitted 11 July, 2005;
originally announced July 2005.
-
Inter-occurrence Times in the Bak-Tang-Wiesenfeld Sandpile Model: A Comparison with the Turbulent Statistics of Solar Flares
Authors:
Maya Paczuski,
Stefan Boettcher,
Marco Baiesi
Abstract:
A sequence of bursts observed in an intermittent time series may be caused by a single avalanche, even though these bursts appear as distinct events when noise and/or instrument resolution impose a detection threshold. In the Bak-Tang-Wiesenfeld sandpile, the statistics of quiet times between bursts switches from Poissonian to scale invariant on raising the threshold for detecting instantaneous…
▽ More
A sequence of bursts observed in an intermittent time series may be caused by a single avalanche, even though these bursts appear as distinct events when noise and/or instrument resolution impose a detection threshold. In the Bak-Tang-Wiesenfeld sandpile, the statistics of quiet times between bursts switches from Poissonian to scale invariant on raising the threshold for detecting instantaneous activity, since each zero-threshold avalanche breaks into a hierarchy of correlated bursts. Calibrating the model with the time resolution of GOES data, qualitative agreement with the inter-occurrence time statistics of solar flares at different intensity thresholds is found.
△ Less
Submitted 20 June, 2005;
originally announced June 2005.
-
Networks as Renormalized Models for Emergent Behavior in Physical Systems
Authors:
Maya Paczuski
Abstract:
Networks are paradigms for describing complex biological, social and technological systems. Here I argue that networks provide a coherent framework to construct coarse-grained models for many different physical systems. To elucidate these ideas, I discuss two long-standing problems. The first concerns the structure and dynamics of magnetic fields in the solar corona, as exemplified by sunspots t…
▽ More
Networks are paradigms for describing complex biological, social and technological systems. Here I argue that networks provide a coherent framework to construct coarse-grained models for many different physical systems. To elucidate these ideas, I discuss two long-standing problems. The first concerns the structure and dynamics of magnetic fields in the solar corona, as exemplified by sunspots that startled Galileo almost 400 years ago. We discovered that the magnetic structure of the corona embodies a scale free network, with spots at all scales. A network model representing the three-dimensional geometry of magnetic fields, where links rewire and nodes merge when they collide in space, gives quantitative agreement with available data, and suggests new measurements. Seismicity is addressed in terms of relations between events without imposing space-time windows. A metric estimates the correlation between any two earthquakes. Linking strongly correlated pairs, and ignoring pairs with weak correlation organizes the spatio-temporal process into a sparse, directed, weighted network. New scaling laws for seismicity are found. For instance, the aftershock decay rate decreases as 1/t in time up to a correlation time, t[omori]. An estimate from the data gives t[omori] to be about one year for small magnitude 3 earthquakes, about 1400 years for the Landers event, and roughly 26,000 years for the earthquake causing the 2004 Asian tsunami. Our results confirm Kagan's conjecture that aftershocks can rumble on for centuries.
△ Less
Submitted 7 February, 2005;
originally announced February 2005.
-
Correlated dynamics in human printing behavior
Authors:
Uli Harder,
Maya Paczuski
Abstract:
Arrival times of requests to print in a student laboratory were analyzed. Inter-arrival times between subsequent requests follow a universal scaling law relating time intervals and the size of the request, indicating a scale invariant dynamics with respect to the size. The cumulative distribution of file sizes is well-described by a modified power law often seen in non-equilibrium critical syste…
▽ More
Arrival times of requests to print in a student laboratory were analyzed. Inter-arrival times between subsequent requests follow a universal scaling law relating time intervals and the size of the request, indicating a scale invariant dynamics with respect to the size. The cumulative distribution of file sizes is well-described by a modified power law often seen in non-equilibrium critical systems. For each user, waiting times between their individual requests show long range dependence and are broadly distributed from seconds to weeks. All results are incompatible with Poisson models, and may provide evidence of critical dynamics associated with voluntary thought processes in the brain.
△ Less
Submitted 7 December, 2004;
originally announced December 2004.
-
Intensity Thresholds and the Statistics of the Temporal Occurrence of Solar Flares
Authors:
Marco Baiesi,
Maya Paczuski,
Attilio L. Stella
Abstract:
Introducing thresholds to analyze time series of emission from the Sun enables a new and simple definition of solar flare events, and their interoccurrence times. Rescaling time by the rate of events, the waiting and quiet time distributions both conform to scaling functions that are independent of the intensity threshold over a wide range. The scaling functions are well described by a two param…
▽ More
Introducing thresholds to analyze time series of emission from the Sun enables a new and simple definition of solar flare events, and their interoccurrence times. Rescaling time by the rate of events, the waiting and quiet time distributions both conform to scaling functions that are independent of the intensity threshold over a wide range. The scaling functions are well described by a two parameter function, with parameters that depend on the phase of the solar cycle. For flares identified according to the current, standard definition, similar behavior is found.
△ Less
Submitted 17 November, 2005; v1 submitted 12 November, 2004;
originally announced November 2004.
-
How far away is the next earthquake?
Authors:
Jörn Davidsen,
Maya Paczuski
Abstract:
Spatial distances between subsequent earthquakes in southern California exhibit scale-free statistics, with a critical exponent $δ\approx 0.6$, as well as finite size scaling. The statistics are independent of the threshold magnitude as long as the catalog is complete, but depend strongly on the temporal ordering of events, rather than the geometry of the spatial epicenter distribution. Neverthe…
▽ More
Spatial distances between subsequent earthquakes in southern California exhibit scale-free statistics, with a critical exponent $δ\approx 0.6$, as well as finite size scaling. The statistics are independent of the threshold magnitude as long as the catalog is complete, but depend strongly on the temporal ordering of events, rather than the geometry of the spatial epicenter distribution. Nevertheless, the spatial distance and waiting time between subsequent earthquakes are uncorrelated with each other. These observations contradict the theory of aftershock zone scaling with main shock magnitude.
△ Less
Submitted 11 November, 2004;
originally announced November 2004.
-
A dynamical model of a GRID market
Authors:
Uli Harder,
Peter Harrison,
Maya Paczuski,
Tejas Shah
Abstract:
We discuss potential market mechanisms for the GRID. A complete dynamical model of a GRID market is defined with three types of agents. Providers, middlemen and users exchange universal GRID computing units (GCUs) at varying prices. Providers and middlemen have strategies aimed at maximizing profit while users are 'satisficing' agents, and only change their behavior if the service they receive i…
▽ More
We discuss potential market mechanisms for the GRID. A complete dynamical model of a GRID market is defined with three types of agents. Providers, middlemen and users exchange universal GRID computing units (GCUs) at varying prices. Providers and middlemen have strategies aimed at maximizing profit while users are 'satisficing' agents, and only change their behavior if the service they receive is sufficiently poor or overpriced. Preliminary results from a multi-agent numerical simulation of the market model shows that the distribution of price changes has a power law tail.
△ Less
Submitted 2 October, 2004;
originally announced October 2004.
-
Complex networks of earthquakes and aftershocks
Authors:
Marco Baiesi,
Maya Paczuski
Abstract:
We invoke a metric to quantify the correlation between any two earthquakes. This provides a simple and straightforward alternative to using space-time windows to detect aftershock sequences and obviates the need to distinguish main shocks from aftershocks. Directed networks of earthquakes are constructed by placing a link, directed from the past to the future, between pairs of events that are st…
▽ More
We invoke a metric to quantify the correlation between any two earthquakes. This provides a simple and straightforward alternative to using space-time windows to detect aftershock sequences and obviates the need to distinguish main shocks from aftershocks. Directed networks of earthquakes are constructed by placing a link, directed from the past to the future, between pairs of events that are strongly correlated. Each link has a weight giving the relative strength of correlation such that the sum over the incoming links to any node equals unity for aftershocks, or zero if the event had no correlated predecessors. A correlation threshold is set to drastically reduce the size of the data set without losing significant information. Events can be aftershocks of many previous events, and also generate many aftershocks. The probability distribution for the number of incoming and outgoing links are both scale free, and the networks are highly clustered. The Omori law holds for aftershock rates up to a decorrelation time that scales with the magnitude, $m$, of the initiating shock as $t_{\rm cutoff} \sim 10^{βm}$ with $β\simeq 3/4$. Another scaling law relates distances between earthquakes and their aftershocks to the magnitude of the initiating shock. Our results are inconsistent with the hypothesis of finite aftershock zones. We also find evidence that seismicity is dominantly triggered by small earthquakes. Our approach, using concepts from the modern theory of complex networks, together with a metric to estimate correlations, opens up new avenues of research, as well as new tools to understand seismicity.
△ Less
Submitted 21 December, 2004; v1 submitted 4 August, 2004;
originally announced August 2004.
-
Scaling law for seismic hazard after a main shock
Authors:
Stefano Lise,
Maya Paczuski,
Attilio Stella
Abstract:
After a large earthquake, the likelihood of successive strong aftershocks needs to be estimated. Exploiting similarities with critical phenomena, we introduce a scaling law for the decay in time following a main shock of the expected number of aftershocks greater than a certain magnitude. Empirical results that support our scaling hypothesis are obtained from analyzing the record of earthquakes…
▽ More
After a large earthquake, the likelihood of successive strong aftershocks needs to be estimated. Exploiting similarities with critical phenomena, we introduce a scaling law for the decay in time following a main shock of the expected number of aftershocks greater than a certain magnitude. Empirical results that support our scaling hypothesis are obtained from analyzing the record of earthquakes in California. The proposed form unifies the well-known Omori and Gutenberg-Richter laws of seismicity, together with other phenomenological observations. Our results substantially modify presently employed estimates and may lead to an improved assessment of seismic hazard after a large earthquake.}
△ Less
Submitted 1 March, 2004;
originally announced March 2004.
-
A Heavenly Example of Scale Free Networks and Self-Organized Criticality
Authors:
Maya Paczuski,
David Hughes
Abstract:
The sun provides an explosive, heavenly example of self-organized criticality. Sudden bursts of intense radiation emanate from rapid rearrangements of the magnetic field network in the corona. Avalanches are triggered by loops of flux that reconnect or snap into lower energy configurations when they are overly stressed. Our recent analysis of observational data reveals that the loops (links) and…
▽ More
The sun provides an explosive, heavenly example of self-organized criticality. Sudden bursts of intense radiation emanate from rapid rearrangements of the magnetic field network in the corona. Avalanches are triggered by loops of flux that reconnect or snap into lower energy configurations when they are overly stressed. Our recent analysis of observational data reveals that the loops (links) and footpoints (nodes), where they attach on the photosphere, embody a scale free network. The statistics of the avalanches and of the network structure are unified through a simple dynamical model where the avalanches and network co-generate each other into a complex, critical state. This particular example points toward a general dynamical mechanism for self-generation of complex networks.
△ Less
Submitted 13 November, 2003;
originally announced November 2003.
-
Scale free networks of earthquakes and aftershocks
Authors:
Marco Baiesi,
Maya Paczuski
Abstract:
We propose a new metric to quantify the correlation between any two earthquakes. The metric consists of a product involving the time interval and spatial distance between two events, as well as the magnitude of the first one. According to this metric, events typically are strongly correlated to only one or a few preceding ones. Thus a classification of events as foreshocks, main shocks or afters…
▽ More
We propose a new metric to quantify the correlation between any two earthquakes. The metric consists of a product involving the time interval and spatial distance between two events, as well as the magnitude of the first one. According to this metric, events typically are strongly correlated to only one or a few preceding ones. Thus a classification of events as foreshocks, main shocks or aftershocks emerges automatically without imposing predefined space-time windows. To construct a network, each earthquake receives an incoming link from its most correlated predecessor. The number of aftershocks for any event, identified by its outgoing links, is found to be scale free with exponent $γ= 2.0(1)$. The original Omori law with $p=1$ emerges as a robust feature of seismicity, holding up to years even for aftershock sequences initiated by intermediate magnitude events. The measured fat-tailed distribution of distances between earthquakes and their aftershocks suggests that aftershock collection with fixed space windows is not appropriate.
△ Less
Submitted 21 September, 2003;
originally announced September 2003.
-
Scaling in Fracture and Refreezing of Sea Ice
Authors:
R. Korsnes,
S. R. Souza,
R. Donangelo,
R. Donangelo,
M. Paczuski,
K. Sneppen
Abstract:
Sea ice breaks up and regenerates rapidly during winter conditions in the Arctic. Analyzing satellite data from the Kara Sea, we find that the average ice floe size depends on weather conditions. Nevertheless, the frequency of floes of size $A$ is a power law, $N\sim A^{-τ}$, where $τ=1.6\pm 0.2$, for $A$ less than approximately 100 $km^2$. This scale-invariant behaviour suggests a competition b…
▽ More
Sea ice breaks up and regenerates rapidly during winter conditions in the Arctic. Analyzing satellite data from the Kara Sea, we find that the average ice floe size depends on weather conditions. Nevertheless, the frequency of floes of size $A$ is a power law, $N\sim A^{-τ}$, where $τ=1.6\pm 0.2$, for $A$ less than approximately 100 $km^2$. This scale-invariant behaviour suggests a competition between fracture due to strains in the ice field and refreezing of the fractures. A cellular model for this process gives results consistent with observations.
△ Less
Submitted 18 September, 2003;
originally announced September 2003.
-
Scale-Free Magnetic Networks: Comparing Observational Data with a Self-Organizing Model of the Coronal Field
Authors:
David Hughes,
Maya Paczuski
Abstract:
We propose that the coronal magnetic field, linking concentrations on the photosphere through an interwoven web of flux, embodies a scale-free network. It arises from a self-organized critical dynamics including flux emergence, the diffusion and merging of magnetic concentrations, as well as avalanches of reconnecting flux tubes. Magnetic concentrations such as fragments, pores and sunspots, are…
▽ More
We propose that the coronal magnetic field, linking concentrations on the photosphere through an interwoven web of flux, embodies a scale-free network. It arises from a self-organized critical dynamics including flux emergence, the diffusion and merging of magnetic concentrations, as well as avalanches of reconnecting flux tubes. Magnetic concentrations such as fragments, pores and sunspots, are `nodes' joined by flux tubes or `links'. The number of links emanating from a node is scale-free. We reanalyze the quiet-Sun data of Close et al and show that the distribution of magnetic concentration strengths is a power law with an index $γ= 1.7 \pm 0.3$, over the entire range of the measurement, about $(2-500)\times 10^{17}$ Mx. This distribution is compatible with that for the sizes of active regions reported by Harvey and Schwaan. Thus magnetic concentrations may be scale-free from the smallest measurable fragments to the large active regions. Numerical simulations of a self-organized critical model give the same index $γ$, within statistical uncertainty. The exponential distribution of flux tube lengths also agrees quantitatively with results from the model. Calibration with the measured diffusion constant of magnetic concentrations allows us to calculate a flux turnover time in the model to be of order 10 hours and the total solar flux to be of order $10^{23}$Mx, agreeing with observations. We introduce two other statistical quantities to characterize scale-free networks. The probability distribution for the amount of flux connecting a pair of concentrations, and the number of distinct concentrations linked to a given one are predicted to be scale-free, with different indices. Our approach unifies the observation of scale free flare energies with the coronal magnetic field structure.
△ Less
Submitted 8 September, 2003;
originally announced September 2003.
-
Solar Flares as Cascades of Reconnecting Magnetic Loops
Authors:
D. Hughes,
M. Paczuski,
R. O. Dendy,
P. Helander,
K. G. McClements
Abstract:
A model for the solar coronal magnetic field is proposed where multiple directed loops evolve in space and time. Loops injected at small scales are anchored by footpoints of opposite polarity moving randomly on a surface. Nearby footpoints of the same polarity aggregate, and loops can reconnect when they collide. This may trigger a cascade of further reconnection, representing a solar flare. Num…
▽ More
A model for the solar coronal magnetic field is proposed where multiple directed loops evolve in space and time. Loops injected at small scales are anchored by footpoints of opposite polarity moving randomly on a surface. Nearby footpoints of the same polarity aggregate, and loops can reconnect when they collide. This may trigger a cascade of further reconnection, representing a solar flare. Numerical simulations show that a power law distribution of flare energies emerges, associated with a scale free network of loops, indicating self-organized criticality.
△ Less
Submitted 21 February, 2003; v1 submitted 9 October, 2002;
originally announced October 2002.
-
A Nonconservative Earthquake Model of Self-Organized Criticality on a Random Graph
Authors:
Stefano Lise,
Maya Paczuski
Abstract:
We numerically investigate the Olami-Feder-Christensen model on a quenched random graph. Contrary to the case of annealed random neighbors, we find that the quenched model exhibits self-organized criticality deep within the nonconservative regime. The probability distribution for avalanche size obeys finite size scaling, with universal critical exponents. In addition, a power law relation betwee…
▽ More
We numerically investigate the Olami-Feder-Christensen model on a quenched random graph. Contrary to the case of annealed random neighbors, we find that the quenched model exhibits self-organized criticality deep within the nonconservative regime. The probability distribution for avalanche size obeys finite size scaling, with universal critical exponents. In addition, a power law relation between the size and the duration of an avalanche exists. We propose that this may represent the correct mean-field limit of the model rather than the annealed random neighbor version.
△ Less
Submitted 23 April, 2002;
originally announced April 2002.
-
Luminous matter may arise from a turbulent plasma state of the early universe
Authors:
Per Bak,
Maya Paczuski
Abstract:
The almost perfect uniformity of the cosmic microwave background (CMB) radiation, discovered by Penzias and Wilson in 1965 appears to present clearcut evidence that the universe was uniform and in equilibrium at the decoupling transition when a plasma of protons and electrons condensed into a gas of Hydrogen. COBE indicates that only very small ripples of order 10^{-5} existed at decoupling. Gra…
▽ More
The almost perfect uniformity of the cosmic microwave background (CMB) radiation, discovered by Penzias and Wilson in 1965 appears to present clearcut evidence that the universe was uniform and in equilibrium at the decoupling transition when a plasma of protons and electrons condensed into a gas of Hydrogen. COBE indicates that only very small ripples of order 10^{-5} existed at decoupling. Gravity then caused hydrogen to cluster and possibly reheat parts of the universe to form the luminous matter that we observe today. We suggest an alternative scenario, where a spatially intermittent structure of extremely hot matter already existed in an otherwise uniform plasma state at the decoupling transition. The plasma was not in equilibrium but in a very high Reynolds number turbulent state. The sparse bursts would not affect the uniformity of the CMB radiation. Luminous matter originates from localized hot bursts already present in the plasma state prior to decoupling. No reheating, and no exotic matter is needed to get luminous matter.
△ Less
Submitted 3 April, 2002;
originally announced April 2002.