-
Topology reveals universal features for network comparison
Authors:
Pierre-André G. Maugis,
Sofia C. Olhede,
Patrick J. Wolfe
Abstract:
The topology of any complex system is key to understanding its structure and function. Fundamentally, algebraic topology guarantees that any system represented by a network can be understood through its closed paths. The length of each path provides a notion of scale, which is vitally important in characterizing dominant modes of system behavior. Here, by combining topology with scale, we prove th…
▽ More
The topology of any complex system is key to understanding its structure and function. Fundamentally, algebraic topology guarantees that any system represented by a network can be understood through its closed paths. The length of each path provides a notion of scale, which is vitally important in characterizing dominant modes of system behavior. Here, by combining topology with scale, we prove the existence of universal features which reveal the dominant scales of any network. We use these features to compare several canonical network types in the context of a social media discussion which evolves through the sharing of rumors, leaks and other news. Our analysis enables for the first time a universal understanding of the balance between loops and tree-like structure across network scales, and an assessment of how this balance interacts with the spreading of information online. Crucially, our results allow networks to be quantified and compared in a purely model-free way that is theoretically sound, fully automated, and inherently scalable.
△ Less
Submitted 16 May, 2017;
originally announced May 2017.
-
Statistical inference for network samples using subgraph counts
Authors:
P-A. G. Maugis,
Carey E. Priebe,
S. C. Olhede,
P. J. Wolfe
Abstract:
We consider that a network is an observation, and a collection of observed networks forms a sample. In this setting, we provide methods to test whether all observations in a network sample are drawn from a specified model. We achieve this by deriving, under the null of the graphon model, the joint asymptotic properties of average subgraph counts as the number of observed networks increases but the…
▽ More
We consider that a network is an observation, and a collection of observed networks forms a sample. In this setting, we provide methods to test whether all observations in a network sample are drawn from a specified model. We achieve this by deriving, under the null of the graphon model, the joint asymptotic properties of average subgraph counts as the number of observed networks increases but the number of nodes in each network remains finite. In doing so, we do not require that each observed network contains the same number of nodes, or is drawn from the same distribution. Our results yield joint confidence regions for subgraph counts, and therefore methods for testing whether the observations in a network sample are drawn from: a specified distribution, a specified model, or from the same model as another network sample. We present simulation experiments and an illustrative example on a sample of brain networks where we find that highly creative individuals' brains present significantly more short cycles.
△ Less
Submitted 28 May, 2019; v1 submitted 2 January, 2017;
originally announced January 2017.
-
Event Conditional Correlation: Or How Non-Linear Linear Dependence Can Be
Authors:
P-A. G. Maugis
Abstract:
Entries of datasets are often collected only if an event occurred: taking a survey, enrolling in an experiment and so forth. However, such partial samples bias classical correlation estimators. Here we show how to correct for such sampling effects through two complementary estimators of event conditional correlation: the correlation of two random variables conditional on a given event. First, we p…
▽ More
Entries of datasets are often collected only if an event occurred: taking a survey, enrolling in an experiment and so forth. However, such partial samples bias classical correlation estimators. Here we show how to correct for such sampling effects through two complementary estimators of event conditional correlation: the correlation of two random variables conditional on a given event. First, we provide under minimal assumptions proof of consistency and asymptotic normality for the proposed estimators. Then, through synthetic examples, we show that these estimators behave well in small-sample and yield powerful methodologies for non-linear regression as well as dependence testing. Finally, by using the two estimators in tandem, we explore counterfactual dependence regimes in a financial dataset. By so doing we show that the contagion which took place during the 2007--2011 financial crisis cannot be explained solely by increased financial risk.
△ Less
Submitted 4 January, 2016; v1 submitted 6 January, 2014;
originally announced January 2014.