-
Comment on "Dynamic Opinion Model and Invasion Percolation"
Authors:
A. Sattari,
M. Paczuski,
P. Grassberger
Abstract:
In J. Shao et al., PRL 103, 108701 (2009) the authors claim that a model with majority rule coarsening exhibits in d=2 a percolation transition in the universality class of invasion percolation with trapping. In the present comment we give compelling evidence, including high statistics simulations on much larger lattices, that this is not correct. and that the model is trivially in the ordinary pe…
▽ More
In J. Shao et al., PRL 103, 108701 (2009) the authors claim that a model with majority rule coarsening exhibits in d=2 a percolation transition in the universality class of invasion percolation with trapping. In the present comment we give compelling evidence, including high statistics simulations on much larger lattices, that this is not correct. and that the model is trivially in the ordinary percolation universality class.
△ Less
Submitted 8 August, 2012;
originally announced August 2012.
-
PageRank and rank-reversal dependence on the damping factor
Authors:
Seung-Woo Son,
Claire Christensen,
Peter Grassberger,
Maya Paczuski
Abstract:
PageRank (PR) is an algorithm originally developed by Google to evaluate the importance of web pages. Considering how deeply rooted Google's PR algorithm is to gathering relevant information or to the success of modern businesses, the question of rank-stability and choice of the damping factor (a parameter in the algorithm) is clearly important. We investigate PR as a function of the damping facto…
▽ More
PageRank (PR) is an algorithm originally developed by Google to evaluate the importance of web pages. Considering how deeply rooted Google's PR algorithm is to gathering relevant information or to the success of modern businesses, the question of rank-stability and choice of the damping factor (a parameter in the algorithm) is clearly important. We investigate PR as a function of the damping factor d on a network obtained from a domain of the World Wide Web, finding that rank-reversal happens frequently over a broad range of PR (and of d). We use three different correlation measures, Pearson, Spearman, and Kendall, to study rank-reversal as d changes, and show that the correlation of PR vectors drops rapidly as d changes from its frequently cited value, $d_0=0.85$. Rank-reversal is also observed by measuring the Spearman and Kendall rank correlation, which evaluate relative ranks rather than absolute PR. Rank-reversal happens not only in directed networks containing rank-sinks but also in a single strongly connected component, which by definition does not contain any sinks. We relate rank-reversals to rank-pockets and bottlenecks in the directed network structure. For the network studied, the relative rank is more stable by our measures around $d=0.65$ than at $d=d_0$.
△ Less
Submitted 23 January, 2012;
originally announced January 2012.
-
Sampling properties of directed networks
Authors:
Seung-Woo Son,
Claire Christensen,
Golnoosh Bizhani,
David V. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
For many real-world networks only a small "sampled" version of the original network may be investigated; those results are then used to draw conclusions about the actual system. Variants of breadth-first search (BFS) sampling, which are based on epidemic processes, are widely used. Although it is well established that BFS sampling fails, in most cases, to capture the IN-component(s) of directed ne…
▽ More
For many real-world networks only a small "sampled" version of the original network may be investigated; those results are then used to draw conclusions about the actual system. Variants of breadth-first search (BFS) sampling, which are based on epidemic processes, are widely used. Although it is well established that BFS sampling fails, in most cases, to capture the IN-component(s) of directed networks, a description of the effects of BFS sampling on other topological properties are all but absent from the literature. To systematically study the effects of sampling biases on directed networks, we compare BFS sampling to random sampling on complete large-scale directed networks. We present new results and a thorough analysis of the topological properties of seven different complete directed networks (prior to sampling), including three versions of Wikipedia, three different sources of sampled World Wide Web data, and an Internet-based social network. We detail the differences that sampling method and coverage can make to the structural properties of sampled versions of these seven networks. Most notably, we find that sampling method and coverage affect both the bow-tie structure, as well as the number and structure of strongly connected components in sampled networks. In addition, at low sampling coverage (i.e. less than 40%), the values of average degree, variance of out-degree, degree auto-correlation, and link reciprocity are overestimated by 30% or more in BFS-sampled networks, and only attain values within 10% of the corresponding values in the complete networks when sampling coverage is in excess of 65%. These results may cause us to rethink what we know about the structure, function, and evolution of real-world directed networks.
△ Less
Submitted 13 October, 2012; v1 submitted 6 January, 2012;
originally announced January 2012.
-
Random Sequential Renormalization and Agglomerative Percolation in Networks: Application to Erd"os-R'enyi and Scale-free Graphs
Authors:
Golnoosh Bizhani,
Peter Grassberger,
Maya Paczuski
Abstract:
We study the statistical behavior under random sequential renormalization(RSR) of several network models including Erd"os R'enyi (ER) graphs, scale-free networks and an annealed model (AM) related to ER graphs. In RSR the network is locally coarse grained by choosing at each renormalization step a node at random and joining it to all its neighbors. Compared to previous (quasi-)parallel renormaliza…
▽ More
We study the statistical behavior under random sequential renormalization(RSR) of several network models including Erd"os R'enyi (ER) graphs, scale-free networks and an annealed model (AM) related to ER graphs. In RSR the network is locally coarse grained by choosing at each renormalization step a node at random and joining it to all its neighbors. Compared to previous (quasi-)parallel renormalization methods [C.Song et.al], RSR allows a more fine-grained analysis of the renormalization group (RG) flow, and unravels new features, that were not discussed in the previous analyses. In particular we find that all networks exhibit a second order transition in their RG flow. This phase transition is associated with the emergence of a giant hub and can be viewed as a new variant of percolation, called agglomerative percolation. We claim that this transition exists also in previous graph renormalization schemes and explains some of the scaling laws seen there. For critical trees it happens as N/N0 -> 0 in the limit of large systems (where N0 is the initial size of the graph and N its size at a given RSR step). In contrast, it happens at finite N/N0 in sparse ER graphs and in the annealed model, while it happens for N/N0 -> 1 on scale-free networks. Critical exponents seem to depend on the type of the graph but not on the average degree and obey usual scaling relations for percolation phenomena. For the annealed model they agree with the exponents obtained from a mean-field theory. At late times, the networks exhibit a star-like structure in agreement with the results of Radicchi et. al. While degree distributions are of main interest when regarding the scheme as network renormalization, mass distributions (which are more relevant when considering 'supernodes' as clusters) are much easier to study using the fast Newman-Ziff algorithm for percolation, allowing us to obtain very high statistics.
△ Less
Submitted 12 December, 2011; v1 submitted 21 September, 2011;
originally announced September 2011.
-
Percolation Theory on Interdependent Networks Based on Epidemic Spreading
Authors:
Seung-Woo Son,
Golnoosh Bizhani,
Claire Christensen,
Peter Grassberger,
Maya Paczuski
Abstract:
We consider percolation on interdependent locally treelike networks, recently introduced by Buldyrev et al., Nature 464, 1025 (2010), and demonstrate that the problem can be simplified conceptually by deleting all references to cascades of failures. Such cascades do exist, but their explicit treatment just complicates the theory -- which is a straightforward extension of the usual epidemic spreadi…
▽ More
We consider percolation on interdependent locally treelike networks, recently introduced by Buldyrev et al., Nature 464, 1025 (2010), and demonstrate that the problem can be simplified conceptually by deleting all references to cascades of failures. Such cascades do exist, but their explicit treatment just complicates the theory -- which is a straightforward extension of the usual epidemic spreading theory on a single network. Our method has the added benefits that it is directly formulated in terms of an order parameter and its modular structure can be easily extended to other problems, e.g. to any number of interdependent networks, or to networks with dependency links.
△ Less
Submitted 20 September, 2011;
originally announced September 2011.
-
Exact solutions for mass-dependent irreversible aggregations
Authors:
Seung-Woo Son,
Claire Christensen,
Golnoosh Bizhani,
Peter Grassberger,
Maya Paczuski
Abstract:
We consider the mass-dependent aggregation process (k+1)X -> X, given a fixed number of unit mass particles in the initial state. One cluster is chosen proportional to its mass and is merged into one either with k-neighbors in one dimension, or -- in the well-mixed case -- with k other clusters picked randomly. We find the same combinatorial exact solutions for the probability to find any given co…
▽ More
We consider the mass-dependent aggregation process (k+1)X -> X, given a fixed number of unit mass particles in the initial state. One cluster is chosen proportional to its mass and is merged into one either with k-neighbors in one dimension, or -- in the well-mixed case -- with k other clusters picked randomly. We find the same combinatorial exact solutions for the probability to find any given configuration of particles on a ring or line, and in the well-mixed case. The mass distribution of a single cluster exhibits scaling laws and the finite size scaling form is given. The relation to the classical sum kernel of irreversible aggregation is discussed.
△ Less
Submitted 31 August, 2011;
originally announced September 2011.
-
Are Percolation Transitions always Sharpened by Making Networks Interdependent?
Authors:
Seung-Woo Son,
Peter Grassberger,
Maya Paczuski
Abstract:
We study a model for coupled networks introduced recently by Buldyrev et al., Nature 464, 1025 (2010), where each node has to be connected to others via two types of links to be viable. Removing a critical fraction of nodes leads to a percolation transition that has been claimed to be more abrupt than that for uncoupled networks. Indeed, it was found to be discontinuous in all cases studied. Using…
▽ More
We study a model for coupled networks introduced recently by Buldyrev et al., Nature 464, 1025 (2010), where each node has to be connected to others via two types of links to be viable. Removing a critical fraction of nodes leads to a percolation transition that has been claimed to be more abrupt than that for uncoupled networks. Indeed, it was found to be discontinuous in all cases studied. Using an efficient new algorithm we verify that the transition is discontinuous for coupled Erdos-Renyi networks, but find it to be continuous for fully interdependent diluted lattices. In 2 and 3 dimension, the order parameter exponent $β$ is larger than in ordinary percolation, showing that the transition is less sharp, i.e. further from discontinuity, than for isolated networks. Possible consequences for spatially embedded networks are discussed.
△ Less
Submitted 19 September, 2011; v1 submitted 18 August, 2011;
originally announced August 2011.
-
Clustering Drives Assortativity and Community Structure in Ensembles of Networks
Authors:
David V. Foster,
Jacob G. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
Clustering, assortativity, and communities are key features of complex networks. We probe dependencies between these attributes and find that ensembles with strong clustering display both high assortativity by degree and prominent community structure, while ensembles with high assortativity are much less biased towards clustering or community structure. Further, clustered networks can amplify smal…
▽ More
Clustering, assortativity, and communities are key features of complex networks. We probe dependencies between these attributes and find that ensembles with strong clustering display both high assortativity by degree and prominent community structure, while ensembles with high assortativity are much less biased towards clustering or community structure. Further, clustered networks can amplify small homophilic bias for trait assortativity. This marked asymmetry suggests that transitivity, rather than homophily, drives the standard nonsocial/social network dichotomy.
△ Less
Submitted 5 January, 2011; v1 submitted 10 December, 2010;
originally announced December 2010.
-
Irreversible Aggregation and Network Renormalization
Authors:
Seung-Woo Son,
Golnoosh Bizhani,
Claire Christensen,
Peter Grassberger,
Maya Paczuski
Abstract:
Irreversible aggregation is revisited in view of recent work on renormalization of complex networks. Its scaling laws and phase transitions are related to percolation transitions seen in the latter. We illustrate our points by giving the complete solution for the probability to find any given state in an aggregation process $(k+1)X\to X$, given a fixed number of unit mass particles in the initial…
▽ More
Irreversible aggregation is revisited in view of recent work on renormalization of complex networks. Its scaling laws and phase transitions are related to percolation transitions seen in the latter. We illustrate our points by giving the complete solution for the probability to find any given state in an aggregation process $(k+1)X\to X$, given a fixed number of unit mass particles in the initial state. Exactly the same probability distributions and scaling are found in one dimensional systems (a trivial network) and well-mixed solutions. This reveals that scaling laws found in renormalization of complex networks do not prove that they are self-similar.
△ Less
Submitted 3 March, 2011; v1 submitted 3 November, 2010;
originally announced November 2010.
-
Random Sequential Renormalization of Networks I: Application to Critical Trees
Authors:
Golnoosh Bizhani,
Vishal Sood,
Maya Paczuski,
Peter Grassberger
Abstract:
We introduce the concept of Random Sequential Renormalization (RSR) for arbitrary networks. RSR is a graph renormalization procedure that locally aggregates nodes to produce a coarse grained network. It is analogous to the (quasi-)parallel renormalization schemes introduced by C. Song {\it et al.} (Nature {\bf 433}, 392 (2005)) and studied more recently by F. Radicchi {\it et al.} (Phys. Rev. Lett…
▽ More
We introduce the concept of Random Sequential Renormalization (RSR) for arbitrary networks. RSR is a graph renormalization procedure that locally aggregates nodes to produce a coarse grained network. It is analogous to the (quasi-)parallel renormalization schemes introduced by C. Song {\it et al.} (Nature {\bf 433}, 392 (2005)) and studied more recently by F. Radicchi {\it et al.} (Phys. Rev. Lett. {\bf 101}, 148701 (2008)), but much simpler and easier to implement. In this first paper we apply RSR to critical trees and derive analytical results consistent with numerical simulations. Critical trees exhibit three regimes in their evolution under RSR: (i) An initial regime $N_0^ν\lesssim N<N_0$, where $N$ is the number of nodes at some step in the renormalization and $N_0$ is the initial size. RSR in this regime is described by a mean field theory and fluctuations from one realization to another are small. The exponent $ν=1/2$ is derived using random walk arguments. The degree distribution becomes broader under successive renormalization -- reaching a power law, $p_k\sim 1/k^γ$ with $γ=2$ and a variance that diverges as $N_0^{1/2}$ at the end of this regime. Both of these results are derived based on a scaling theory. (ii) An intermediate regime for $N_0^{1/4}\lesssim N \lesssim N_0^{1/2}$, in which hubs develop, and fluctuations between different realizations of the RSR are large. Crossover functions exhibiting finite size scaling, in the critical region $N\sim N_0^{1/2} \to \infty$, connect the behaviors in the first two regimes. (iii) The last regime, for $1 \ll N\lesssim N_0^{1/4}$, is characterized by the appearance of star configurations with a central hub surrounded by many leaves. The distribution of sizes where stars first form is found numerically to be a power law up to a cutoff that scales as $N_0^{ν_{star}}$ with $ν_{star}\approx 1/4$.
△ Less
Submitted 23 March, 2011; v1 submitted 20 September, 2010;
originally announced September 2010.
-
The Interacting Branching Process as a Simple Model of Innovation
Authors:
Vishal Sood,
Myléne Mathieu,
Amer Shreim,
Peter Grassberger,
Maya Paczuski
Abstract:
We describe innovation in terms of a generalized branching process. Each new invention pairs with any existing one to produce a number of offspring, which is Poisson distributed with mean p. Existing inventions die with probability p/τat each generation. In contrast to mean field results, no phase transition occurs; the chance for survival is finite for all p > 0. For τ= \infty, surviving processe…
▽ More
We describe innovation in terms of a generalized branching process. Each new invention pairs with any existing one to produce a number of offspring, which is Poisson distributed with mean p. Existing inventions die with probability p/τat each generation. In contrast to mean field results, no phase transition occurs; the chance for survival is finite for all p > 0. For τ= \infty, surviving processes exhibit a bottleneck before exploding super-exponentially - a growth consistent with a law of accelerating returns. This behavior persists for finite τ. We analyze, in detail, the asymptotic behavior as p \to 0.
△ Less
Submitted 17 September, 2010; v1 submitted 30 March, 2010;
originally announced March 2010.
-
Attractor and Basin Entropies of Random Boolean Networks Under Asynchronous Stochastic Update
Authors:
Amer Shreim,
Andrew Berdahl,
Florian Greil,
Jörn Davidsen,
Maya Paczuski
Abstract:
We introduce a numerical method to study random Boolean networks with asynchronous stochas- tic update. Each node in the network of states starts with equal occupation probability and this probability distribution then evolves to a steady state. Nodes left with finite occupation probability determine the attractors and the sizes of their basins. As for synchronous update, the basin entropy grows w…
▽ More
We introduce a numerical method to study random Boolean networks with asynchronous stochas- tic update. Each node in the network of states starts with equal occupation probability and this probability distribution then evolves to a steady state. Nodes left with finite occupation probability determine the attractors and the sizes of their basins. As for synchronous update, the basin entropy grows with system size only for critical networks, where the distribution of attractor lengths is a power law. We determine analytically the distribution for the number of attractors and basin sizes for frozen networks with connectivity K = 1.
△ Less
Submitted 10 March, 2010;
originally announced March 2010.
-
Clustering Phase Transitions and Hysteresis: Pitfalls in Constructing Network Ensembles
Authors:
David V. Foster,
Jacob G. Foster,
Maya Paczuski,
Peter Grassberger
Abstract:
Ensembles of networks are used as null models in many applications. However, simple null models often show much less clustering than their real-world counterparts. In this paper, we study a model where clustering is enhanced by means of a fugacity term as in the Strauss (or "triangle") model, but where the degree sequence is strictly preserved -- thus maintaining the quenched heterogeneity of no…
▽ More
Ensembles of networks are used as null models in many applications. However, simple null models often show much less clustering than their real-world counterparts. In this paper, we study a model where clustering is enhanced by means of a fugacity term as in the Strauss (or "triangle") model, but where the degree sequence is strictly preserved -- thus maintaining the quenched heterogeneity of nodes found in the original degree sequence. Similar models had been proposed previously in [R. Milo et al., Science 298, 824 (2002)]. We find that our model exhibits phase transitions as the fugacity is changed. For regular graphs (identical degrees for all nodes) with degree k > 2 we find a single first order transition. For all non-regular networks that we studied (including Erdos - Renyi and scale-free networks) we find multiple jumps resembling first order transitions, together with strong hysteresis. The latter transitions are driven by the sudden emergence of "cluster cores": groups of highly interconnected nodes with higher than average degrees. To study these cluster cores visually, we introduce q-clique adjacency plots. We find that these cluster cores constitute distinct communities which emerge spontaneously from the triangle generating process. Finally, we point out that cluster cores produce pitfalls when using the present (and similar) models as null models for strongly clustered networks, due to the very strong hysteresis which effectively leads to broken ergodicity on realistic time scales.
△ Less
Submitted 11 November, 2009;
originally announced November 2009.
-
Activity Dependent Branching Ratios in Stocks, Solar X-ray Flux, and the Bak-Tang-Wiesenfeld Sandpile Model
Authors:
Elliot Martin,
Amer Shreim,
Maya Paczuski
Abstract:
We define an activity dependent branching ratio that allows comparison of different time series $X_{t}$. The branching ratio $b_x$ is defined as $b_x= E[ξ_x/x]$. The random variable $ξ_x$ is the value of the next signal given that the previous one is equal to $x$, so $ξ_x=\{X_{t+1}|X_t=x\}$. If $b_x>1$, the process is on average supercritical when the signal is equal to $x$, while if $b_x<1$, it…
▽ More
We define an activity dependent branching ratio that allows comparison of different time series $X_{t}$. The branching ratio $b_x$ is defined as $b_x= E[ξ_x/x]$. The random variable $ξ_x$ is the value of the next signal given that the previous one is equal to $x$, so $ξ_x=\{X_{t+1}|X_t=x\}$. If $b_x>1$, the process is on average supercritical when the signal is equal to $x$, while if $b_x<1$, it is subcritical. For stock prices we find $b_x=1$ within statistical uncertainty, for all $x$, consistent with an ``efficient market hypothesis''. For stock volumes, solar X-ray flux intensities, and the Bak-Tang-Wiesenfeld (BTW) sandpile model, $b_x$ is supercritical for small values of activity and subcritical for the largest ones, indicating a tendency to return to a typical value. For stock volumes this tendency has an approximate power law behavior. For solar X-ray flux and the BTW model, there is a broad regime of activity where $b_x \simeq 1$, which we interpret as an indicator of critical behavior. This is true despite different underlying probability distributions for $X_t$, and for $ξ_x$. For the BTW model the distribution of $ξ_x$ is Gaussian, for $x$ sufficiently larger than one, and its variance grows linearly with $x$. Hence, the activity in the BTW model obeys a central limit theorem when sampling over past histories. The broad region of activity where $b_x$ is close to one disappears once bulk dissipation is introduced in the BTW model -- supporting our hypothesis that it is an indicator of criticality.
△ Less
Submitted 13 October, 2009;
originally announced October 2009.
-
Edge direction and the structure of networks
Authors:
Jacob G. Foster,
David V. Foster,
Peter Grassberger,
Maya Paczuski
Abstract:
Directed networks are ubiquitous and are necessary to represent complex systems with asymmetric interactions---from food webs to the World Wide Web. Despite the importance of edge direction for detecting local and community structure, it has been disregarded in studying a basic type of global diversity in networks: the tendency of nodes with similar numbers of edges to connect. This tendency, call…
▽ More
Directed networks are ubiquitous and are necessary to represent complex systems with asymmetric interactions---from food webs to the World Wide Web. Despite the importance of edge direction for detecting local and community structure, it has been disregarded in studying a basic type of global diversity in networks: the tendency of nodes with similar numbers of edges to connect. This tendency, called assortativity, affects crucial structural and dynamic properties of real-world networks, such as error tolerance or epidemic spreading. Here we demonstrate that edge direction has profound effects on assortativity. We define a set of four directed assortativity measures and assign statistical significance by comparison to randomized networks. We apply these measures to three network classes---online/social networks, food webs, and word-adjacency networks. Our measures (i) reveal patterns common to each class, (ii) separate networks that have been previously classified together, and (iii) expose limitations of several existing theoretical models. We reject the standard classification of directed networks as purely assortative or disassortative. Many display a class-specific mixture, likely reflecting functional or historical constraints, contingencies, and forces guiding the system's evolution.
△ Less
Submitted 7 November, 2010; v1 submitted 28 August, 2009;
originally announced August 2009.
-
Reply to Comment on ``Analysis of the spatial distribution between successive earthquakes''
Authors:
J. Davidsen,
M. Paczuski
Abstract:
This is a reply to the Comment on ``Analysis of the spatial distribution between successive earthquakes'' by Maximilian Jonas Werner and Didier Sornette.
This is a reply to the Comment on ``Analysis of the spatial distribution between successive earthquakes'' by Maximilian Jonas Werner and Didier Sornette.
△ Less
Submitted 6 May, 2008;
originally announced May 2008.
-
Avalanches, branching ratios, and clustering of attractors in Random Boolean Networks and in the segment polarity network of \emph{Drosophila}
Authors:
Andrew Berdahl,
Amer Shreim,
Vishal Sood,
Joern Davidsen,
Maya Paczuski
Abstract:
We discuss basic features of emergent complexity in dynamical systems far from equilibrium by focusing on the network structure of their state space. We start by measuring the distributions of avalanche and transient times in Random Boolean Networks (RBNs) and in the \emph{Drosophila} polarity network by exact enumeration. A transient time is the duration of the transient from a starting state t…
▽ More
We discuss basic features of emergent complexity in dynamical systems far from equilibrium by focusing on the network structure of their state space. We start by measuring the distributions of avalanche and transient times in Random Boolean Networks (RBNs) and in the \emph{Drosophila} polarity network by exact enumeration. A transient time is the duration of the transient from a starting state to an attractor. An avalanche is a special transient which starts as single Boolean element perturbation of an attractor state. Significant differences at short times between the avalanche and the transient times for RBNs with small connectivity $K$ -- compared to the number of elements $N$ -- indicate that attractors tend to cluster in configuration space. In addition, one bit flip has a non-negligible chance to put an attractor state directly onto another attractor. This clustering is also present in the segment polarity gene network of \emph{Drosophila melanogaster}, suggesting that this may be a robust feature of biological regulatory networks. We also define and measure a branching ratio for the state space networks and find evidence for a new time scale that diverges roughly linearly with $N$ for $2\leq K \ll N$. Analytic arguments show that this time scale does not appear in the random map nor can the random map exhibit clustering of attractors. We further show that for K=2 the branching ratio exhibits the largest variation with distance from the attractor compared to other values of $K$ and that the avalanche durations exhibit no characteristic scale within our statistical resolution. Hence, we propose that the branching ratio and the avalanche duration are new indicators for scale-free behavior that may or may not be found simultaneously with other indicators of emergent complexity in extended, deterministic dynamical systems.
△ Less
Submitted 2 May, 2008;
originally announced May 2008.
-
Graph animals, subgraph sampling and motif search in large networks
Authors:
Kim Baskerville,
Peter Grassberger,
Maya Paczuski
Abstract:
We generalize a sampling algorithm for lattice animals (connected clusters on a regular lattice) to a Monte Carlo algorithm for `graph animals', i.e. connected subgraphs in arbitrary networks. As with the algorithm in [N. Kashtan et al., Bioinformatics 20, 1746 (2004)], it provides a weighted sample, but the computation of the weights is much faster (linear in the size of subgraphs, instead of s…
▽ More
We generalize a sampling algorithm for lattice animals (connected clusters on a regular lattice) to a Monte Carlo algorithm for `graph animals', i.e. connected subgraphs in arbitrary networks. As with the algorithm in [N. Kashtan et al., Bioinformatics 20, 1746 (2004)], it provides a weighted sample, but the computation of the weights is much faster (linear in the size of subgraphs, instead of super-exponential). This allows subgraphs with up to ten or more nodes to be sampled with very high statistics, from arbitrarily large networks. Using this together with a heuristic algorithm for rapidly classifying isomorphic graphs, we present results for two protein interaction networks obtained using the TAP high throughput method: one of Escherichia coli with 230 nodes and 695 links, and one for yeast (Saccharomyces cerevisiae) with roughly ten times more nodes and links. We find in both cases that most connected subgraphs are strong motifs (Z-scores >10) or anti-motifs (Z-scores <-10) when the null model is the ensemble of networks with fixed degree sequence. Strong differences appear between the two networks, with dominant motifs in E. coli being (nearly) bipartite graphs and having many pairs of nodes which connect to the same neighbors, while dominant motifs in yeast tend towards completeness or contain large cliques. We also explore a number of methods that do not rely on measurements of Z-scores or comparisons with null models. For instance, we discuss the influence of specific complexes like the 26S proteasome in yeast, where a small number of complexes dominate the $k$-cores with large k and have a decisive effect on the strongest motifs with 6 to 8 nodes. We also present Zipf plots of counts versus rank. They show broad distributions that are not power laws, in contrast to the case when disconnected subgraphs are included.
△ Less
Submitted 22 June, 2007; v1 submitted 13 February, 2007;
originally announced February 2007.
-
Networks of Recurrent Events, a Theory of Records, and an Application to Finding Causal Signatures in Seismicity
Authors:
J. Davidsen,
P. Grassberger,
M. Paczuski
Abstract:
We propose a method to search for signs of causal structure in spatiotemporal data making minimal a priori assumptions about the underlying dynamics. To this end, we generalize the elementary concept of recurrence for a point process in time to recurrent events in space and time. An event is defined to be a recurrence of any previous event if it is closer to it in space than all the intervening…
▽ More
We propose a method to search for signs of causal structure in spatiotemporal data making minimal a priori assumptions about the underlying dynamics. To this end, we generalize the elementary concept of recurrence for a point process in time to recurrent events in space and time. An event is defined to be a recurrence of any previous event if it is closer to it in space than all the intervening events. As such, each sequence of recurrences for a given event is a record breaking process. This definition provides a strictly data driven technique to search for structure. Defining events to be nodes, and linking each event to its recurrences, generates a network of recurrent events. Significant deviations in properties of that network compared to networks arising from random processes allows one to infer attributes of the causal dynamics that generate observable correlations in the patterns. We derive analytically a number of properties for the network of recurrent events composed by a random process. We extend the theory of records to treat not only the variable where records happen, but also time as continuous. In this way, we construct a fully symmetric theory of records leading to a number of new results. Those analytic results are compared to the properties of a network synthesized from earthquakes in Southern California. Significant disparities from the ensemble of acausal networks that can be plausibly attributed to the causal structure of seismicity are: (1) Invariance of network statistics with the time span of the events considered, (2) Appearance of a fundamental length scale for recurrences, independent of the time span of the catalog, which is consistent with observations of the ``rupture length'', (3) Hierarchy in the distances and times of subsequent recurrences.
△ Less
Submitted 29 April, 2008; v1 submitted 16 January, 2007;
originally announced January 2007.
-
Earthquake recurrence as a record breaking process
Authors:
Joern Davidsen,
Peter Grassberger,
Maya Paczuski
Abstract:
Extending the central concept of recurrence times for a point process to recurrent events in space-time allows us to characterize seismicity as a record breaking process using only spatiotemporal relations among events. Linking record breaking events with edges between nodes in a graph generates a complex dynamical network isolated from any length, time or magnitude scales set by the observer. F…
▽ More
Extending the central concept of recurrence times for a point process to recurrent events in space-time allows us to characterize seismicity as a record breaking process using only spatiotemporal relations among events. Linking record breaking events with edges between nodes in a graph generates a complex dynamical network isolated from any length, time or magnitude scales set by the observer. For Southern California, the network of recurrences reveals new statistical features of seismicity with robust scaling laws. The rupture length and its scaling with magnitude emerges as a generic measure for distance between recurrent events. Further, the relative separations for subsequent records in space (or time) form a hierarchy with unexpected scaling properties.
△ Less
Submitted 28 June, 2006; v1 submitted 11 July, 2005;
originally announced July 2005.
-
Networks as Renormalized Models for Emergent Behavior in Physical Systems
Authors:
Maya Paczuski
Abstract:
Networks are paradigms for describing complex biological, social and technological systems. Here I argue that networks provide a coherent framework to construct coarse-grained models for many different physical systems. To elucidate these ideas, I discuss two long-standing problems. The first concerns the structure and dynamics of magnetic fields in the solar corona, as exemplified by sunspots t…
▽ More
Networks are paradigms for describing complex biological, social and technological systems. Here I argue that networks provide a coherent framework to construct coarse-grained models for many different physical systems. To elucidate these ideas, I discuss two long-standing problems. The first concerns the structure and dynamics of magnetic fields in the solar corona, as exemplified by sunspots that startled Galileo almost 400 years ago. We discovered that the magnetic structure of the corona embodies a scale free network, with spots at all scales. A network model representing the three-dimensional geometry of magnetic fields, where links rewire and nodes merge when they collide in space, gives quantitative agreement with available data, and suggests new measurements. Seismicity is addressed in terms of relations between events without imposing space-time windows. A metric estimates the correlation between any two earthquakes. Linking strongly correlated pairs, and ignoring pairs with weak correlation organizes the spatio-temporal process into a sparse, directed, weighted network. New scaling laws for seismicity are found. For instance, the aftershock decay rate decreases as 1/t in time up to a correlation time, t[omori]. An estimate from the data gives t[omori] to be about one year for small magnitude 3 earthquakes, about 1400 years for the Landers event, and roughly 26,000 years for the earthquake causing the 2004 Asian tsunami. Our results confirm Kagan's conjecture that aftershocks can rumble on for centuries.
△ Less
Submitted 7 February, 2005;
originally announced February 2005.
-
How far away is the next earthquake?
Authors:
Jörn Davidsen,
Maya Paczuski
Abstract:
Spatial distances between subsequent earthquakes in southern California exhibit scale-free statistics, with a critical exponent $δ\approx 0.6$, as well as finite size scaling. The statistics are independent of the threshold magnitude as long as the catalog is complete, but depend strongly on the temporal ordering of events, rather than the geometry of the spatial epicenter distribution. Neverthe…
▽ More
Spatial distances between subsequent earthquakes in southern California exhibit scale-free statistics, with a critical exponent $δ\approx 0.6$, as well as finite size scaling. The statistics are independent of the threshold magnitude as long as the catalog is complete, but depend strongly on the temporal ordering of events, rather than the geometry of the spatial epicenter distribution. Nevertheless, the spatial distance and waiting time between subsequent earthquakes are uncorrelated with each other. These observations contradict the theory of aftershock zone scaling with main shock magnitude.
△ Less
Submitted 11 November, 2004;
originally announced November 2004.
-
Complex networks of earthquakes and aftershocks
Authors:
Marco Baiesi,
Maya Paczuski
Abstract:
We invoke a metric to quantify the correlation between any two earthquakes. This provides a simple and straightforward alternative to using space-time windows to detect aftershock sequences and obviates the need to distinguish main shocks from aftershocks. Directed networks of earthquakes are constructed by placing a link, directed from the past to the future, between pairs of events that are st…
▽ More
We invoke a metric to quantify the correlation between any two earthquakes. This provides a simple and straightforward alternative to using space-time windows to detect aftershock sequences and obviates the need to distinguish main shocks from aftershocks. Directed networks of earthquakes are constructed by placing a link, directed from the past to the future, between pairs of events that are strongly correlated. Each link has a weight giving the relative strength of correlation such that the sum over the incoming links to any node equals unity for aftershocks, or zero if the event had no correlated predecessors. A correlation threshold is set to drastically reduce the size of the data set without losing significant information. Events can be aftershocks of many previous events, and also generate many aftershocks. The probability distribution for the number of incoming and outgoing links are both scale free, and the networks are highly clustered. The Omori law holds for aftershock rates up to a decorrelation time that scales with the magnitude, $m$, of the initiating shock as $t_{\rm cutoff} \sim 10^{βm}$ with $β\simeq 3/4$. Another scaling law relates distances between earthquakes and their aftershocks to the magnitude of the initiating shock. Our results are inconsistent with the hypothesis of finite aftershock zones. We also find evidence that seismicity is dominantly triggered by small earthquakes. Our approach, using concepts from the modern theory of complex networks, together with a metric to estimate correlations, opens up new avenues of research, as well as new tools to understand seismicity.
△ Less
Submitted 21 December, 2004; v1 submitted 4 August, 2004;
originally announced August 2004.
-
Scaling law for seismic hazard after a main shock
Authors:
Stefano Lise,
Maya Paczuski,
Attilio Stella
Abstract:
After a large earthquake, the likelihood of successive strong aftershocks needs to be estimated. Exploiting similarities with critical phenomena, we introduce a scaling law for the decay in time following a main shock of the expected number of aftershocks greater than a certain magnitude. Empirical results that support our scaling hypothesis are obtained from analyzing the record of earthquakes…
▽ More
After a large earthquake, the likelihood of successive strong aftershocks needs to be estimated. Exploiting similarities with critical phenomena, we introduce a scaling law for the decay in time following a main shock of the expected number of aftershocks greater than a certain magnitude. Empirical results that support our scaling hypothesis are obtained from analyzing the record of earthquakes in California. The proposed form unifies the well-known Omori and Gutenberg-Richter laws of seismicity, together with other phenomenological observations. Our results substantially modify presently employed estimates and may lead to an improved assessment of seismic hazard after a large earthquake.}
△ Less
Submitted 1 March, 2004;
originally announced March 2004.
-
Scale free networks of earthquakes and aftershocks
Authors:
Marco Baiesi,
Maya Paczuski
Abstract:
We propose a new metric to quantify the correlation between any two earthquakes. The metric consists of a product involving the time interval and spatial distance between two events, as well as the magnitude of the first one. According to this metric, events typically are strongly correlated to only one or a few preceding ones. Thus a classification of events as foreshocks, main shocks or afters…
▽ More
We propose a new metric to quantify the correlation between any two earthquakes. The metric consists of a product involving the time interval and spatial distance between two events, as well as the magnitude of the first one. According to this metric, events typically are strongly correlated to only one or a few preceding ones. Thus a classification of events as foreshocks, main shocks or aftershocks emerges automatically without imposing predefined space-time windows. To construct a network, each earthquake receives an incoming link from its most correlated predecessor. The number of aftershocks for any event, identified by its outgoing links, is found to be scale free with exponent $γ= 2.0(1)$. The original Omori law with $p=1$ emerges as a robust feature of seismicity, holding up to years even for aftershock sequences initiated by intermediate magnitude events. The measured fat-tailed distribution of distances between earthquakes and their aftershocks suggests that aftershock collection with fixed space windows is not appropriate.
△ Less
Submitted 21 September, 2003;
originally announced September 2003.
-
Scale-Free Magnetic Networks: Comparing Observational Data with a Self-Organizing Model of the Coronal Field
Authors:
David Hughes,
Maya Paczuski
Abstract:
We propose that the coronal magnetic field, linking concentrations on the photosphere through an interwoven web of flux, embodies a scale-free network. It arises from a self-organized critical dynamics including flux emergence, the diffusion and merging of magnetic concentrations, as well as avalanches of reconnecting flux tubes. Magnetic concentrations such as fragments, pores and sunspots, are…
▽ More
We propose that the coronal magnetic field, linking concentrations on the photosphere through an interwoven web of flux, embodies a scale-free network. It arises from a self-organized critical dynamics including flux emergence, the diffusion and merging of magnetic concentrations, as well as avalanches of reconnecting flux tubes. Magnetic concentrations such as fragments, pores and sunspots, are `nodes' joined by flux tubes or `links'. The number of links emanating from a node is scale-free. We reanalyze the quiet-Sun data of Close et al and show that the distribution of magnetic concentration strengths is a power law with an index $γ= 1.7 \pm 0.3$, over the entire range of the measurement, about $(2-500)\times 10^{17}$ Mx. This distribution is compatible with that for the sizes of active regions reported by Harvey and Schwaan. Thus magnetic concentrations may be scale-free from the smallest measurable fragments to the large active regions. Numerical simulations of a self-organized critical model give the same index $γ$, within statistical uncertainty. The exponential distribution of flux tube lengths also agrees quantitatively with results from the model. Calibration with the measured diffusion constant of magnetic concentrations allows us to calculate a flux turnover time in the model to be of order 10 hours and the total solar flux to be of order $10^{23}$Mx, agreeing with observations. We introduce two other statistical quantities to characterize scale-free networks. The probability distribution for the amount of flux connecting a pair of concentrations, and the number of distinct concentrations linked to a given one are predicted to be scale-free, with different indices. Our approach unifies the observation of scale free flare energies with the coronal magnetic field structure.
△ Less
Submitted 8 September, 2003;
originally announced September 2003.
-
Solar Flares as Cascades of Reconnecting Magnetic Loops
Authors:
D. Hughes,
M. Paczuski,
R. O. Dendy,
P. Helander,
K. G. McClements
Abstract:
A model for the solar coronal magnetic field is proposed where multiple directed loops evolve in space and time. Loops injected at small scales are anchored by footpoints of opposite polarity moving randomly on a surface. Nearby footpoints of the same polarity aggregate, and loops can reconnect when they collide. This may trigger a cascade of further reconnection, representing a solar flare. Num…
▽ More
A model for the solar coronal magnetic field is proposed where multiple directed loops evolve in space and time. Loops injected at small scales are anchored by footpoints of opposite polarity moving randomly on a surface. Nearby footpoints of the same polarity aggregate, and loops can reconnect when they collide. This may trigger a cascade of further reconnection, representing a solar flare. Numerical simulations show that a power law distribution of flare energies emerges, associated with a scale free network of loops, indicating self-organized criticality.
△ Less
Submitted 21 February, 2003; v1 submitted 9 October, 2002;
originally announced October 2002.