-
The amplifier effect of artificial agents in social contagion
Authors:
Eric Hitz,
Mingmin Feng,
Radu Tanase,
René Algesheimer,
Manuel S. Mariani
Abstract:
Recent advances in artificial intelligence have led to the proliferation of artificial agents in social contexts, ranging from education to online social media and financial markets, among many others. The increasing rate at which artificial and human agents interact makes it urgent to understand the consequences of human-machine interactions for the propagation of new ideas, products, and behavio…
▽ More
Recent advances in artificial intelligence have led to the proliferation of artificial agents in social contexts, ranging from education to online social media and financial markets, among many others. The increasing rate at which artificial and human agents interact makes it urgent to understand the consequences of human-machine interactions for the propagation of new ideas, products, and behaviors in society. Across two distinct empirical contexts, we find here that artificial agents lead to significantly faster and wider social contagion. To this end, we replicate a choice experiment previously conducted with human subjects by using artificial agents powered by large language models (LLMs). We use the experiment's results to measure the adoption thresholds of artificial agents and their impact on the spread of social contagion. We find that artificial agents tend to exhibit lower adoption thresholds than humans, which leads to wider network-based social contagions. Our findings suggest that the increased presence of artificial agents in real-world networks may accelerate behavioral shifts, potentially in unforeseen ways.
△ Less
Submitted 10 March, 2025; v1 submitted 28 February, 2025;
originally announced February 2025.
-
Evidence of Replica Symmetry Breaking under the Nishimori conditions in epidemic inference on graphs
Authors:
Alfredo Braunstein,
Louise Budzynski,
Matteo Mariani,
Federico Ricci-Tersenghi
Abstract:
In Bayesian inference, computing the posterior distribution from the data is typically a non-trivial problem, which usually requires approximations such as mean-field approaches or numerical methods, like the Monte Carlo Markov Chain. Being a high-dimensional distribution over a set of correlated variables, the posterior distribution can undergo the notorious replica symmetry breaking transition.…
▽ More
In Bayesian inference, computing the posterior distribution from the data is typically a non-trivial problem, which usually requires approximations such as mean-field approaches or numerical methods, like the Monte Carlo Markov Chain. Being a high-dimensional distribution over a set of correlated variables, the posterior distribution can undergo the notorious replica symmetry breaking transition. When it happens, several mean-field methods and virtually every Monte Carlo scheme can not provide a reasonable approximation to the posterior and its marginals. Replica symmetry is believed to be guaranteed whenever the data is generated with known prior and likelihood distributions, namely under the so-called Nishimori conditions. In this paper, we break this belief, by providing a counter-example showing that, under the Nishimori conditions, replica symmetry breaking arises. Introducing a simple, geometrical model that can be thought of as a patient zero retrieval problem in a highly infectious regime of the epidemic Susceptible-Infectious model, we show that under the Nishimori conditions, there is evidence of replica symmetry breaking. We achieve this result by computing the instability of the replica symmetric cavity method toward the one step replica symmetry broken phase. The origin of this phenomenon -- replica symmetry breaking under the Nishimori conditions -- is likely due to the correlated disorder appearing in the epidemic models.
△ Less
Submitted 28 February, 2025; v1 submitted 18 February, 2025;
originally announced February 2025.
-
Collective dynamics behind success
Authors:
Manuel S. Mariani,
Federico Battiston,
Emőke-Ágnes Horvát,
Giacomo Livan,
Federico Musciotto,
Dashun Wang
Abstract:
Understanding the collective dynamics behind the success of ideas, products, behaviors, and social actors is critical for decision-making across diverse contexts, including hiring, funding, career choices, and the design of interventions for social change. Methodological advances and the increasing availability of big data now allow for a broader and deeper understanding of the key facets of succe…
▽ More
Understanding the collective dynamics behind the success of ideas, products, behaviors, and social actors is critical for decision-making across diverse contexts, including hiring, funding, career choices, and the design of interventions for social change. Methodological advances and the increasing availability of big data now allow for a broader and deeper understanding of the key facets of success. Recent studies unveil regularities beneath the collective dynamics of success, pinpoint underlying mechanisms, and even enable predictions of success across diverse domains, including science, technology, business, and the arts. However, this research also uncovers troubling biases that challenge meritocratic views of success. This review synthesizes the growing, cross-disciplinary literature on the collective dynamics behind success and calls for further research on cultural influences, the origins of inequalities, the role of algorithms in perpetuating them, and experimental methods to further probe causal mechanisms behind success. Ultimately, these efforts may help to better align success with desired societal values.
△ Less
Submitted 23 December, 2024;
originally announced December 2024.
-
Uncovering key predictors of high-growth firms via explainable machine learning
Authors:
Yiwei Huang,
Shuqi Xu,
Linyuan Lü,
Andrea Zaccaria,
Manuel Sebastian Mariani
Abstract:
Predicting high-growth firms has attracted increasing interest from the technological forecasting and machine learning communities. Most existing studies primarily utilize financial data for these predictions. However, research suggests that a firm's research and development activities and its network position within technological ecosystems may also serve as valuable predictors. To unpack the rel…
▽ More
Predicting high-growth firms has attracted increasing interest from the technological forecasting and machine learning communities. Most existing studies primarily utilize financial data for these predictions. However, research suggests that a firm's research and development activities and its network position within technological ecosystems may also serve as valuable predictors. To unpack the relative importance of diverse features, this paper analyzes financial and patent data from 5,071 firms, extracting three categories of features: financial features, technological features of granted patents, and network-based features derived from firms' connections to their primary technologies. By utilizing ensemble learning algorithms, we demonstrate that incorporating financial features with either technological, network-based features, or both, leads to more accurate high-growth firm predictions compared to using financial features alone. To delve deeper into the matter, we evaluate the predictive power of each individual feature within their respective categories using explainable artificial intelligence methods. Among non-financial features, the maximum economic value of a firm's granted patents and the number of patents related to a firms' primary technologies stand out for their importance. Furthermore, firm size is positively associated with high-growth probability up to a certain threshold size, after which the association plateaus. Conversely, the maximum economic value of a firm's granted patents is positively linked to high-growth probability only after a threshold value is exceeded. These findings elucidate the complex predictive role of various features in forecasting high-growth firms and could inform technological resource allocation as well as investment decisions.
△ Less
Submitted 17 August, 2024;
originally announced August 2024.
-
Integrating behavioral experimental findings into dynamical models to inform social change interventions
Authors:
Radu Tanase,
René Algesheimer,
Manuel S. Mariani
Abstract:
Addressing global challenges -- from public health to climate change -- often involves stimulating the large-scale adoption of new products or behaviors. Research traditions that focus on individual decision making suggest that achieving this objective requires better identifying the drivers of individual adoption choices. On the other hand, computational approaches rooted in complexity science fo…
▽ More
Addressing global challenges -- from public health to climate change -- often involves stimulating the large-scale adoption of new products or behaviors. Research traditions that focus on individual decision making suggest that achieving this objective requires better identifying the drivers of individual adoption choices. On the other hand, computational approaches rooted in complexity science focus on maximizing the propagation of a given product or behavior throughout social networks of interconnected adopters. The integration of these two perspectives -- although advocated by several research communities -- has remained elusive so far. Here we show how achieving this integration could inform seeding policies to facilitate the large-scale adoption of a given behavior or product. Drawing on complex contagion and discrete choice theories, we propose a method to estimate individual-level thresholds to adoption, and validate its predictive power in two choice experiments. By integrating the estimated thresholds into computational simulations, we show that state-of-the-art seeding methods for social influence maximization might be suboptimal if they neglect individual-level behavioral drivers, which can be corrected through the proposed experimental method.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Sedimentary rocks from Mediterranean drought in the Messinian age as a probe of the past cosmic ray flux
Authors:
Lorenzo Caccianiga,
Claudio Galelli,
Lorenzo Apollonio,
Federico Maria Mariani,
Paolo Magnani,
Alessandro Veutro
Abstract:
We propose the use of natural minerals as detectors to study the past flux of cosmic rays. This novel application of the \textit{paleo-detector} technique requires a specific approach as it needs samples that have been exposed to secondary cosmic rays for a well defined period of time. We suggest here the use of the evaporites formed during the desiccation of the Mediterranean sea ${\sim}6$ Myr ag…
▽ More
We propose the use of natural minerals as detectors to study the past flux of cosmic rays. This novel application of the \textit{paleo-detector} technique requires a specific approach as it needs samples that have been exposed to secondary cosmic rays for a well defined period of time. We suggest here the use of the evaporites formed during the desiccation of the Mediterranean sea ${\sim}6$ Myr ago. These minerals have been created and exposed to the air or under a shallow water basin for ${\sim}500$ kyr before being quickly submerged again by a km-scale overburden of water. We show that, by looking at the damages left in the minerals by muons in cosmic ray showers, we could detect differences in the primary cosmic ray flux during that period, as the ones expected from nearby supernova explosions, below the percent-level. We show also that little to no background from radioactive contamination and other astroparticles is expected for this kind of analysis.
△ Less
Submitted 17 December, 2024; v1 submitted 8 May, 2024;
originally announced May 2024.
-
Mineral Detection of Neutrinos and Dark Matter 2024. Proceedings
Authors:
Sebastian Baum,
Patrick Huber,
Patrick Stengel,
Natsue Abe,
Daniel G. Ang,
Lorenzo Apollonio,
Gabriela R. Araujo,
Levente Balogh,
Pranshu Bhaumik Yilda Boukhtouchen,
Joseph Bramante,
Lorenzo Caccianiga,
Andrew Calabrese-Day,
Qing Chang,
Juan I. Collar,
Reza Ebadi,
Alexey Elykov,
Katherine Freese,
Audrey Fung,
Claudio Galelli,
Arianna E. Gleason,
Mariano Guerrero Perez,
Janina Hakenmüller,
Takeshi Hanyu,
Noriko Hasebe,
Shigenobu Hirose
, et al. (35 additional authors not shown)
Abstract:
The second "Mineral Detection of Neutrinos and Dark Matter" (MDvDM'24) meeting was held January 8-11, 2024 in Arlington, VA, USA, hosted by Virginia Tech's Center for Neutrino Physics. This document collects contributions from this workshop, providing an overview of activities in the field. MDvDM'24 was the second topical workshop dedicated to the emerging field of mineral detection of neutrinos a…
▽ More
The second "Mineral Detection of Neutrinos and Dark Matter" (MDvDM'24) meeting was held January 8-11, 2024 in Arlington, VA, USA, hosted by Virginia Tech's Center for Neutrino Physics. This document collects contributions from this workshop, providing an overview of activities in the field. MDvDM'24 was the second topical workshop dedicated to the emerging field of mineral detection of neutrinos and dark matter, following a meeting hosted by IFPU in Trieste, Italy in October 2022. Mineral detectors have been proposed for a wide variety of applications, including searching for dark matter, measuring various fluxes of astrophysical neutrinos over gigayear timescales, monitoring nuclear reactors, and nuclear disarmament protocols; both as paleo-detectors using natural minerals that could have recorded the traces of nuclear recoils for timescales as long as a billion years and as detectors recording nuclear recoil events on laboratory timescales using natural or artificial minerals. Contributions to this proceedings discuss the vast physics potential, the progress in experimental studies, and the numerous challenges lying ahead on the path towards mineral detection. These include a better understanding of the formation and annealing of recoil defects in crystals; identifying the best classes of minerals and, for paleo-detectors, understanding their geology; modeling and control of the relevant backgrounds; developing, combining, and scaling up imaging and data analysis techniques; and many others. During the last years, MDvDM has grown rapidly and gained attention. Small-scale experimental efforts focused on establishing various microscopic readout techniques are underway at institutions in North America, Europe and Asia. We are looking ahead to an exciting future full of challenges to overcome, surprises to be encountered, and discoveries lying ahead of us.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Ground observations of a space laser for the assessment of its in-orbit performance
Authors:
The Pierre Auger Collaboration,
O. Lux,
I. Krisch,
O. Reitebuch,
D. Huber,
D. Wernham,
T. Parrinello,
:,
A. Abdul Halim,
P. Abreu,
M. Aglietta,
I. Allekotte,
K. Almeida Cheminant,
A. Almela,
R. Aloisio,
J. Alvarez-Muñiz,
J. Ammerman Yebra,
G. A. Anastasi,
L. Anchordoqui,
B. Andrada,
S. Andringa,
Anukriti,
L. Apollonio,
C. Aramo,
P. R. Araújo Ferreira
, et al. (358 additional authors not shown)
Abstract:
The wind mission Aeolus of the European Space Agency was a groundbreaking achievement for Earth observation. Between 2018 and 2023, the space-borne lidar instrument ALADIN onboard the Aeolus satellite measured atmospheric wind profiles with global coverage which contributed to improving the accuracy of numerical weather prediction. The precision of the wind observations, however, declined over the…
▽ More
The wind mission Aeolus of the European Space Agency was a groundbreaking achievement for Earth observation. Between 2018 and 2023, the space-borne lidar instrument ALADIN onboard the Aeolus satellite measured atmospheric wind profiles with global coverage which contributed to improving the accuracy of numerical weather prediction. The precision of the wind observations, however, declined over the course of the mission due to a progressive loss of the atmospheric backscatter signal. The analysis of the root cause was supported by the Pierre Auger Observatory in Argentina whose fluorescence detector registered the ultraviolet laser pulses emitted from the instrument in space, thereby offering an estimation of the laser energy at the exit of the instrument for several days in 2019, 2020 and 2021. The reconstruction of the laser beam not only allowed for an independent assessment of the Aeolus performance, but also helped to improve the accuracy in the determination of the laser beam's ground track on single pulse level. The results presented in this paper set a precedent for the monitoring of space lasers by ground-based telescopes and open new possibilities for the calibration of cosmic-ray observatories.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Small-Coupling Dynamic Cavity: a Bayesian mean-field framework for epidemic inference
Authors:
Alfredo Braunstein,
Giovanni Catania,
Luca Dall'Asta,
Matteo Mariani,
Fabio Mazza,
Mattia Tarabolo
Abstract:
We present the Small-Coupling Dynamic Cavity (SCDC) method, a novel generalized mean-field approximation for epidemic inference and risk assessment within a fully Bayesian framework. SCDC accounts for non-causal effects of observations and uses a graphical model representation of epidemic processes to derive self-consistent equations for edge probability marginals. A small-coupling expansion yield…
▽ More
We present the Small-Coupling Dynamic Cavity (SCDC) method, a novel generalized mean-field approximation for epidemic inference and risk assessment within a fully Bayesian framework. SCDC accounts for non-causal effects of observations and uses a graphical model representation of epidemic processes to derive self-consistent equations for edge probability marginals. A small-coupling expansion yields time-dependent cavity messages capturing individual infection probabilities and observational conditioning. With linear computational cost per iteration in the epidemic duration, SCDC is particularly efficient and valid even for recurrent epidemic processes, where standard methods are exponentially complex. Tested on synthetic networks, it matches Belief Propagation in accuracy and outperforms individual-based mean-field methods. Notably, despite being derived as a small-infectiousness expansion, SCDC maintains good accuracy even for relatively large infection probabilities. While convergence issues may arise on graphs with long-range correlations, SCDC reliably estimates risk. Future extensions include non-Markovian models and higher-order terms in the dynamic cavity framework.
△ Less
Submitted 10 April, 2025; v1 submitted 6 June, 2023;
originally announced June 2023.
-
Statistical Mechanics of Inference in Epidemic Spreading
Authors:
Alfredo Braunstein,
Louise Budzynski,
Matteo Mariani
Abstract:
We investigate the information-theoretical limits of inference tasks in epidemic spreading on graphs in the thermodynamic limit. The typical inference tasks consist in computing observables of the posterior distribution of the epidemic model given observations taken from a ground truth (sometimes called planted) random trajectory. We can identify two main sources of quenched disorder: the graph en…
▽ More
We investigate the information-theoretical limits of inference tasks in epidemic spreading on graphs in the thermodynamic limit. The typical inference tasks consist in computing observables of the posterior distribution of the epidemic model given observations taken from a ground truth (sometimes called planted) random trajectory. We can identify two main sources of quenched disorder: the graph ensemble and the planted trajectory. The epidemic dynamics however induces non-trivial long-range correlations among individuals' states on the latter. This results in non-local correlated quenched disorder which unfortunately is typically hard to handle. To overcome this difficulty, we divide the dynamical process into two sets of variables: a set of stochastic independent variables (representing transmission delays), plus a set of correlated variables (the infection times) that depend deterministically on the first. Treating the former as quenched variables and the latter as dynamic ones, computing disorder average becomes feasible by means of the Replica Symmetric cavity method. We give theoretical predictions on the posterior probability distribution of the trajectory of each individual, conditioned to observations on the state of individuals at given times, focusing on the Susceptible Infectious (SI) model. In the Bayes-optimal condition, i.e. when true dynamic parameters are known, the inference task is expected to fall in the Replica Symmetric regime. We indeed provide predictions for the information theoretic limits of various inference tasks, in form of phase diagrams. We also identify a region, in the Bayes-Optimal setting, with strong hints of Replica Symmetry Breaking. When true parameters are unknown, we show how a maximum-likelihood procedure is able to recover them with mostly unaffected performance.
△ Less
Submitted 24 July, 2023; v1 submitted 13 April, 2023;
originally announced April 2023.
-
Locating the eigenshield of a network via perturbation theory
Authors:
Ming-Yang Zhou,
Manuel Sebastian Mariani,
Hao Liao,
Rui Mao,
Yi-Cheng Zhang
Abstract:
The functions of complex networks are usually determined by a small set of vital nodes. Finding the best set of vital nodes (eigenshield nodes) is critical to the network's robustness against rumor spreading and cascading failures, which makes it one of the fundamental problems in network science. The problem is challenging as it requires to maximize the influence of nodes in the set while simulta…
▽ More
The functions of complex networks are usually determined by a small set of vital nodes. Finding the best set of vital nodes (eigenshield nodes) is critical to the network's robustness against rumor spreading and cascading failures, which makes it one of the fundamental problems in network science. The problem is challenging as it requires to maximize the influence of nodes in the set while simultaneously minimizing the redundancies between the set's nodes. However, the redundancy mechanism is rarely investigated by previous studies. Here we introduce the matrix perturbation framework to find a small ``eigenshield" set of nodes that, when removed, lead to the largest drop in the network's spectral radius. We show that finding the ``eigenshield" nodes can be translated into the optimization of an objective function that simultaneously accounts for the individual influence of each node and redundancy between different nodes.
We analytically quantify the influence redundancy that explains why an important node might play an insignificant role in the ``eigenshield" node set. Extensive experiments under diverse influence maximization problems, ranging from network dismantling to spreading maximization, demonstrate that the eigenshield detection tends to significantly outperforms state-of-the-art methods across most problems. Our findings shed light on the mechanisms that may lie at the core of the function of vital nodes in complex network.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Inference in conditioned dynamics through causality restoration
Authors:
Alfredo Braunstein,
Giovanni Catania,
Luca Dall'Asta,
Matteo Mariani,
Anna Paola Muntoni
Abstract:
Computing observables from conditioned dynamics is typically computationally hard, because, although obtaining independent samples efficiently from the unconditioned dynamics is usually feasible, generally most of the samples must be discarded (in a form of importance sampling) because they do not satisfy the imposed conditions. Sampling directly from the conditioned distribution is non-trivial, a…
▽ More
Computing observables from conditioned dynamics is typically computationally hard, because, although obtaining independent samples efficiently from the unconditioned dynamics is usually feasible, generally most of the samples must be discarded (in a form of importance sampling) because they do not satisfy the imposed conditions. Sampling directly from the conditioned distribution is non-trivial, as conditioning breaks the causal properties of the dynamics which ultimately renders the sampling procedure efficient. One standard way of achieving it is through a Metropolis Monte-Carlo procedure, but this procedure is normally slow and a very large number of Monte-Carlo steps is needed to obtain a small number of statistically independent samples. In this work, we propose an alternative method to produce independent samples from a conditioned distribution. The method learns the parameters of a generalized dynamical model that optimally describe the conditioned distribution in a variational sense. The outcome is an effective, unconditioned, dynamical model, from which one can trivially obtain independent samples, effectively restoring causality of the conditioned distribution. The consequences are twofold: on the one hand, it allows us to efficiently compute observables from the conditioned dynamics by simply averaging over independent samples. On the other hand, the method gives an effective unconditioned distribution which is easier to interpret. The method is flexible and can be applied virtually to any dynamics. We discuss an important application of the method, namely the problem of epidemic risk assessment from (imperfect) clinical tests, for a large family of time-continuous epidemic models endowed with a Gillespie-like sampler. We show that the method compares favorably against the state of the art, including the soft-margin approach and mean-field methods.
△ Less
Submitted 30 March, 2023; v1 submitted 18 October, 2022;
originally announced October 2022.
-
Forecasting countries' gross domestic product from patent data
Authors:
Yucheng Ye,
Shuqi Xu,
Manuel Sebastian Mariani,
Linyuan Lü
Abstract:
Recent strides in economic complexity have shown that the future economic development of nations can be predicted with a single "economic fitness" variable, which captures countries' competitiveness in international trade. The predictions by this low-dimensional approach could match or even outperform predictions based on much more sophisticated methods, such as those by the International Monetary…
▽ More
Recent strides in economic complexity have shown that the future economic development of nations can be predicted with a single "economic fitness" variable, which captures countries' competitiveness in international trade. The predictions by this low-dimensional approach could match or even outperform predictions based on much more sophisticated methods, such as those by the International Monetary Fund (IMF). However, all prior works in economic complexity aimed to quantify countries' fitness from World Trade export data, without considering the possibility to infer countries' potential for growth from alternative sources of data. Here, motivated by the long-standing relationship between technological development and economic growth, we aim to forecast countries' growth from patent data. Specifically, we construct a citation network between countries from the European Patent Office (EPO) dataset. Initial results suggest that the H-index centrality in this network is a potential candidate to gauge national economic performance. To validate this conjecture, we construct a two-dimensional plane defined by the H-index and GDP per capita, and use a forecasting method based on dynamical systems to test the predicting accuracy of the H-index. We find that the predictions based on the H-index-GDP plane outperform the predictions by IMF by approximately 35%, and they marginally outperform those by the economic fitness extracted from trade data. Our results could inspire further attempts to identify predictors of national growth from different sources of data related to scientific and technological innovation.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
The different structure of economic ecosystems at the scales of companies and countries
Authors:
Dario Laudati,
Manuel S. Mariani,
Luciano Pietronero,
Andrea Zaccaria
Abstract:
A key element to understand complex systems is the relationship between the spatial scale of investigation and the structure of the interrelation among its elements. When it comes to economic systems, it is now well-known that the country-product bipartite network exhibits a nested structure, which is the foundation of different algorithms that have been used to scientifically investigate countrie…
▽ More
A key element to understand complex systems is the relationship between the spatial scale of investigation and the structure of the interrelation among its elements. When it comes to economic systems, it is now well-known that the country-product bipartite network exhibits a nested structure, which is the foundation of different algorithms that have been used to scientifically investigate countries' development and forecast national economic growth. Changing the subject from countries to companies, a significantly different scenario emerges. Through the analysis of a unique dataset of Italian firms' exports and a worldwide dataset comprising countries' exports, here we find that, while a globally nested structure is observed at the country level, a local, in-block nested structure emerges at the level of firms. Remarkably, this in-block nestedness is statistically significant with respect to suitable null models and the algorithmic partitions of products into blocks have a high correspondence with exogenous product classifications. These findings lay a solid foundation for developing a scientific approach based on the physics of complex systems to the analysis of companies, which has been lacking until now.
△ Less
Submitted 3 February, 2022;
originally announced February 2022.
-
Citations or dollars? Early signals of a firm's research success
Authors:
Shuqi Xu,
Manuel S. Mariani,
Linyuan Lü,
Lorenzo Napolitano,
Emanuele Pugliese,
Andrea Zaccaria
Abstract:
Scientific and technological progress is largely driven by firms in many domains, including artificial intelligence and vaccine development. However, we do not know yet whether the success of firms' research activities exhibits dynamic regularities and some degree of predictability. By inspecting the research lifecycles of 7,440 firms, we find that the economic value of a firm's early patents is a…
▽ More
Scientific and technological progress is largely driven by firms in many domains, including artificial intelligence and vaccine development. However, we do not know yet whether the success of firms' research activities exhibits dynamic regularities and some degree of predictability. By inspecting the research lifecycles of 7,440 firms, we find that the economic value of a firm's early patents is an accurate predictor of various dimensions of a firm's future research success. At the same time, a smaller set of future top-performers do not generate early patents of high economic value, but they are detectable via the technological value of their early patents. Importantly, the observed predictability cannot be explained by a cumulative advantage mechanism, and the observed heterogeneity of the firms' temporal success patterns markedly differs from patterns previously observed for individuals' research careers. Our results uncover the dynamical regularities of the research success of firms, and they could inform managerial strategies as well as policies to promote entrepreneurship and accelerate human progress.
△ Less
Submitted 31 July, 2021;
originally announced August 2021.
-
Detecting new edge types in a temporal network model
Authors:
Wenjie Jia,
Manuel S. Mariani,
Linyuan Lü,
Tao Jiang
Abstract:
Networks representing complex systems in nature and society usually involve multiple interaction types. These types suggest essential information on the interactions between components, but not all of the existing types are usually discovered. Therefore, detecting the undiscovered edge types is crucial for deepening our understanding of the network structure. Although previous studies have discuss…
▽ More
Networks representing complex systems in nature and society usually involve multiple interaction types. These types suggest essential information on the interactions between components, but not all of the existing types are usually discovered. Therefore, detecting the undiscovered edge types is crucial for deepening our understanding of the network structure. Although previous studies have discussed the edge label detection problem, we still lack effective methods for uncovering previously-undetected edge types. Here, we develop an effective technique to detect undiscovered new edge types in networks by leveraging a novel temporal network model. Both analytical and numerical results show that the prediction accuracy of our method is perfect when the model networks' time parameter approaches infinity. Furthermore, we find that when time is finite, our method is still significantly more accurate than the baseline.
△ Less
Submitted 26 April, 2021;
originally announced April 2021.
-
The fragility of opinion formation in a complex world
Authors:
Matúš Medo,
Manuel S. Mariani,
Linyuan Lü
Abstract:
With vast amounts of high-quality information at our fingertips, how is it possible that many people believe that the Earth is flat and vaccination harmful? Motivated by this question, we quantify the implications of an opinion formation mechanism whereby an uninformed observer gradually forms opinions about a world composed of subjects interrelated by a signed network of mutual trust and distrust…
▽ More
With vast amounts of high-quality information at our fingertips, how is it possible that many people believe that the Earth is flat and vaccination harmful? Motivated by this question, we quantify the implications of an opinion formation mechanism whereby an uninformed observer gradually forms opinions about a world composed of subjects interrelated by a signed network of mutual trust and distrust. We show numerically and analytically that the observer's resulting opinions are highly inconsistent (they tend to be independent of the observer's initial opinions) and unstable (they exhibit wide stochastic variations). Opinion inconsistency and instability increase with the world complexity represented by the number of subjects, which can be prevented by suitably expanding the observer's initial amount of information. Our findings imply that even an individual who initially trusts credible information sources may end up trusting the deceptive ones if at least a small number of trust relations exist between the credible and deceptive sources.
△ Less
Submitted 23 October, 2020;
originally announced October 2020.
-
Network-based ranking in social systems: three challenges
Authors:
Manuel S. Mariani,
Linyuan Lü
Abstract:
Ranking algorithms are pervasive in our increasingly digitized societies, with important real-world applications including recommender systems, search engines, and influencer marketing practices. From a network science perspective, network-based ranking algorithms solve fundamental problems related to the identification of vital nodes for the stability and dynamics of a complex system. Despite the…
▽ More
Ranking algorithms are pervasive in our increasingly digitized societies, with important real-world applications including recommender systems, search engines, and influencer marketing practices. From a network science perspective, network-based ranking algorithms solve fundamental problems related to the identification of vital nodes for the stability and dynamics of a complex system. Despite the ubiquitous and successful applications of these algorithms, we argue that our understanding of their performance and their applications to real-world problems face three fundamental challenges: (i) Rankings might be biased by various factors; (2) their effectiveness might be limited to specific problems; and (3) agents' decisions driven by rankings might result in potentially vicious feedback mechanisms and unhealthy systemic consequences. Methods rooted in network science and agent-based modeling can help us to understand and overcome these challenges.
△ Less
Submitted 29 May, 2020;
originally announced May 2020.
-
SiPM-matrix readout of two-phase argon detectors using electroluminescence in the visible and near infrared range
Authors:
The DarkSide collaboration,
C. E. Aalseth,
S. Abdelhakim,
P. Agnes,
R. Ajaj,
I. F. M. Albuquerque,
T. Alexander,
A. Alici,
A. K. Alton,
P. Amaudruz,
F. Ameli,
J. Anstey,
P. Antonioli,
M. Arba,
S. Arcelli,
R. Ardito,
I. J. Arnquist,
P. Arpaia,
D. M. Asner,
A. Asunskis,
M. Ave,
H. O. Back,
V. Barbaryan,
A. Barrado Olmedo,
G. Batignani
, et al. (290 additional authors not shown)
Abstract:
Proportional electroluminescence (EL) in noble gases is used in two-phase detectors for dark matter searches to record (in the gas phase) the ionization signal induced by particle scattering in the liquid phase. The "standard" EL mechanism is considered to be due to noble gas excimer emission in the vacuum ultraviolet (VUV). In addition, there are two alternative mechanisms, producing light in the…
▽ More
Proportional electroluminescence (EL) in noble gases is used in two-phase detectors for dark matter searches to record (in the gas phase) the ionization signal induced by particle scattering in the liquid phase. The "standard" EL mechanism is considered to be due to noble gas excimer emission in the vacuum ultraviolet (VUV). In addition, there are two alternative mechanisms, producing light in the visible and near infrared (NIR) ranges. The first is due to bremsstrahlung of electrons scattered on neutral atoms ("neutral bremsstrahlung", NBrS). The second, responsible for electron avalanche scintillation in the NIR at higher electric fields, is due to transitions between excited atomic states. In this work, we have for the first time demonstrated two alternative techniques of the optical readout of two-phase argon detectors, in the visible and NIR range, using a silicon photomultiplier matrix and electroluminescence due to either neutral bremsstrahlung or avalanche scintillation. The amplitude yield and position resolution were measured for these readout techniques, which allowed to assess the detection threshold for electron and nuclear recoils in two-phase argon detectors for dark matter searches. To the best of our knowledge, this is the first practical application of the NBrS effect in detection science.
△ Less
Submitted 26 February, 2021; v1 submitted 4 April, 2020;
originally announced April 2020.
-
Absence of a resolution limit in in-block nestedness
Authors:
Manuel S. Mariani,
María J. Palazzi,
Albert Solé-Ribalta,
Javier Borge-Holthoefer,
Claudio J. Tessone
Abstract:
Originally a speculative pattern in ecological networks, the hybrid or compound nested-modular pattern has been confirmed, during the last decade, as a relevant structural arrangement that emerges in a variety of contexts --in ecological mutualistic system and beyond. This implies shifting the focus from the measurement of nestedness as a global property (macro level), to the detection of blocks (…
▽ More
Originally a speculative pattern in ecological networks, the hybrid or compound nested-modular pattern has been confirmed, during the last decade, as a relevant structural arrangement that emerges in a variety of contexts --in ecological mutualistic system and beyond. This implies shifting the focus from the measurement of nestedness as a global property (macro level), to the detection of blocks (meso level) that internally exhibit a high degree of nestedness. Unfortunately, the availability and understanding of the methods to properly detect in-block nested partitions lie behind the empirical findings: while a precise quality function of in-block nestedness has been proposed, we lack an understanding of its possible inherent constraints. Specifically, while it is well known that Newman-Girvan's modularity, and related quality functions, notoriously suffer from a resolution limit that impairs their ability to detect small blocks, the potential existence of resolution limits for in-block nestedness is unexplored. Here, we provide empirical, numerical and analytical evidence that the in-block nestedness function lacks a resolution limit, and thus our capacity to detect correct partitions in networks via its maximization depends solely on the accuracy of the optimization algorithms.
△ Less
Submitted 19 February, 2020;
originally announced February 2020.
-
Design and construction of a new detector to measure ultra-low radioactive-isotope contamination of argon
Authors:
The DarkSide Collaboration,
C. E. Aalseth,
S. Abdelhakim,
F. Acerbi,
P. Agnes,
R. Ajaj,
I. F. M. Albuquerque,
T. Alexander,
A. Alici,
A. K. Alton,
P. Amaudruz,
F. Ameli,
J. Anstey,
P. Antonioli,
M. Arba,
S. Arcelli,
R. Ardito,
I. J. Arnquist,
P. Arpaia,
D. M. Asner,
A. Asunskis,
M. Ave,
H. O. Back,
A. Barrado Olmedo,
G. Batignani
, et al. (306 additional authors not shown)
Abstract:
Large liquid argon detectors offer one of the best avenues for the detection of galactic weakly interacting massive particles (WIMPs) via their scattering on atomic nuclei. The liquid argon target allows exquisite discrimination between nuclear and electron recoil signals via pulse-shape discrimination of the scintillation signals. Atmospheric argon (AAr), however, has a naturally occurring radioa…
▽ More
Large liquid argon detectors offer one of the best avenues for the detection of galactic weakly interacting massive particles (WIMPs) via their scattering on atomic nuclei. The liquid argon target allows exquisite discrimination between nuclear and electron recoil signals via pulse-shape discrimination of the scintillation signals. Atmospheric argon (AAr), however, has a naturally occurring radioactive isotope, $^{39}$Ar, a $β$ emitter of cosmogenic origin. For large detectors, the atmospheric $^{39}$Ar activity poses pile-up concerns. The use of argon extracted from underground wells, deprived of $^{39}$Ar, is key to the physics potential of these experiments. The DarkSide-20k dark matter search experiment will operate a dual-phase time projection chamber with 50 tonnes of radio-pure underground argon (UAr), that was shown to be depleted of $^{39}$Ar with respect to AAr by a factor larger than 1400. Assessing the $^{39}$Ar content of the UAr during extraction is crucial for the success of DarkSide-20k, as well as for future experiments of the Global Argon Dark Matter Collaboration (GADMC). This will be carried out by the DArT in ArDM experiment, a small chamber made with extremely radio-pure materials that will be placed at the centre of the ArDM detector, in the Canfranc Underground Laboratory (LSC) in Spain. The ArDM LAr volume acts as an active veto for background radioactivity, mostly $γ$-rays from the ArDM detector materials and the surrounding rock. This article describes the DArT in ArDM project, including the chamber design and construction, and reviews the background required to achieve the expected performance of the detector.
△ Less
Submitted 22 January, 2020;
originally announced January 2020.
-
Simple regularities in the dynamics of online news impact
Authors:
Matúš Medo,
Manuel S. Mariani,
Linyuan Lü
Abstract:
Online news can quickly reach and affect millions of people, yet we do not know yet whether there exist potential dynamical regularities that govern their impact on the public. We use data from two major news outlets, BBC and New York Times, where the number of user comments can be used as a proxy of news impact. We find that the impact dynamics of online news articles does not exhibit popularity…
▽ More
Online news can quickly reach and affect millions of people, yet we do not know yet whether there exist potential dynamical regularities that govern their impact on the public. We use data from two major news outlets, BBC and New York Times, where the number of user comments can be used as a proxy of news impact. We find that the impact dynamics of online news articles does not exhibit popularity patterns found in many other social and information systems. In particular, we find that a simple exponential distribution yields a better fit to the empirical news impact distributions than a power-law distribution. This observation is explained by the lack or limited influence of the otherwise omnipresent rich-get-richer mechanism in the analyzed data. The temporal dynamics of the news impact exhibits a universal exponential decay which allows us to collapse individual news trajectories into an elementary single curve. We also show how daily variations of user activity directly influence the dynamics of the article impact. Our findings challenge the universal applicability of popularity dynamics patterns found in other social contexts.
△ Less
Submitted 22 January, 2021; v1 submitted 16 January, 2020;
originally announced January 2020.
-
Unbiased evaluation of ranking metrics reveals consistent performance in science and technology citation data
Authors:
Shuqi Xu,
Manuel Sebastian Mariani,
Linyuan Lü,
Matúš Medo
Abstract:
Despite the increasing use of citation-based metrics for research evaluation purposes, we do not know yet which metrics best deliver on their promise to gauge the significance of a scientific paper or a patent. We assess 17 network-based metrics by their ability to identify milestone papers and patents in three large citation datasets. We find that traditional information-retrieval evaluation metr…
▽ More
Despite the increasing use of citation-based metrics for research evaluation purposes, we do not know yet which metrics best deliver on their promise to gauge the significance of a scientific paper or a patent. We assess 17 network-based metrics by their ability to identify milestone papers and patents in three large citation datasets. We find that traditional information-retrieval evaluation metrics are strongly affected by the interplay between the age distribution of the milestone items and age biases of the evaluated metrics. Outcomes of these metrics are therefore not representative of the metrics' ranking ability. We argue in favor of a modified evaluation procedure that explicitly penalizes biased metrics and allows us to reveal metrics' performance patterns that are consistent across the datasets. PageRank and LeaderRank turn out to be the best-performing ranking metrics when their age bias is suppressed by a simple transformation of the scores that they produce, whereas other popular metrics, including citation count, HITS and Collective Influence, produce significantly worse ranking results.
△ Less
Submitted 15 January, 2020;
originally announced January 2020.
-
The wisdom of the few: Predicting collective success from individual behavior
Authors:
Manuel S. Mariani,
Yanina Gimenez,
Jorge Brea,
Martin Minnoni,
René Algesheimer,
Claudio J. Tessone
Abstract:
Can we predict top-performing products, services, or businesses by only monitoring the behavior of a small set of individuals? Although most previous studies focused on the predictive power of "hub" individuals with many social contacts, which sources of customer behavioral data are needed to address this question remains unclear, mostly due to the scarcity of available datasets that simultaneousl…
▽ More
Can we predict top-performing products, services, or businesses by only monitoring the behavior of a small set of individuals? Although most previous studies focused on the predictive power of "hub" individuals with many social contacts, which sources of customer behavioral data are needed to address this question remains unclear, mostly due to the scarcity of available datasets that simultaneously capture individuals' purchasing patterns and social interactions. Here, we address this question in a unique, large-scale dataset that combines individuals' credit-card purchasing history with their social and mobility traits across an entire nation. Surprisingly, we find that the purchasing history alone enables the detection of small sets of ``discoverers" whose early purchases offer reliable success predictions for the brick-and-mortar stores they visit. In contrast with the assumptions by most existing studies on word-of-mouth processes, the hubs selected by social network centrality are not consistently predictive of success. Our findings show that companies and organizations with access to large-scale purchasing data can detect the discoverers and leverage their behavior to anticipate market trends, without the need for social network data.
△ Less
Submitted 9 June, 2020; v1 submitted 14 January, 2020;
originally announced January 2020.
-
Nestedness in complex networks: Observation, emergence, and implications
Authors:
Manuel Sebastian Mariani,
Zhuo-Ming Ren,
Jordi Bascompte,
Claudio Juan Tessone
Abstract:
The observed architecture of ecological and socio-economic networks differs significantly from that of random networks. From a network science standpoint, non-random structural patterns observed in real networks call for an explanation of their emergence and an understanding of their potential systemic consequences. This article focuses on one of these patterns: nestedness. Given a network of inte…
▽ More
The observed architecture of ecological and socio-economic networks differs significantly from that of random networks. From a network science standpoint, non-random structural patterns observed in real networks call for an explanation of their emergence and an understanding of their potential systemic consequences. This article focuses on one of these patterns: nestedness. Given a network of interacting nodes, nestedness can be described as the tendency for nodes to interact with subsets of the interaction partners of better-connected nodes. Known since more than $80$ years in biogeography, nestedness has been found in systems as diverse as ecological mutualistic organizations, world trade, inter-organizational relations, among many others. This review article focuses on three main pillars: the existing methodologies to observe nestedness in networks; the main theoretical mechanisms conceived to explain the emergence of nestedness in ecological and socio-economic networks; the implications of a nested topology of interactions for the stability and feasibility of a given interacting system. We survey results from variegated disciplines, including statistical physics, graph theory, ecology, and theoretical economics. Nestedness was found to emerge both in bipartite networks and, more recently, in unipartite ones; this review is the first comprehensive attempt to unify both streams of studies, usually disconnected from each other. We believe that the truly interdisciplinary endeavour -- while rooted in a complex systems perspective -- may inspire new models and algorithms whose realm of application will undoubtedly transcend disciplinary boundaries.
△ Less
Submitted 18 May, 2019;
originally announced May 2019.
-
Temporal similarity metrics for latent network reconstruction: The role of time-lag decay
Authors:
Hao Liao,
Ming-Kai Liu,
Manuel Sebastian Mariani,
Mingyang Zhou,
Xingtong Wu
Abstract:
When investigating the spreading of a piece of information or the diffusion of an innovation, we often lack information on the underlying propagation network. Reconstructing the hidden propagation paths based on the observed diffusion process is a challenging problem which has recently attracted attention from diverse research fields. To address this reconstruction problem, based on static similar…
▽ More
When investigating the spreading of a piece of information or the diffusion of an innovation, we often lack information on the underlying propagation network. Reconstructing the hidden propagation paths based on the observed diffusion process is a challenging problem which has recently attracted attention from diverse research fields. To address this reconstruction problem, based on static similarity metrics commonly used in the link prediction literature, we introduce new node-node temporal similarity metrics. The new metrics take as input the time-series of multiple independent spreading processes, based on the hypothesis that two nodes are more likely to be connected if they were often infected at similar points in time. This hypothesis is implemented by introducing a time-lag function which penalizes distant infection times. We find that the choice of this time-lag strongly affects the metrics' reconstruction accuracy, depending on the network's clustering coefficient and we provide an extensive comparative analysis of static and temporal similarity metrics for network reconstruction. Our findings shed new light on the notion of similarity between pairs of nodes in complex networks.
△ Less
Submitted 4 April, 2019;
originally announced April 2019.
-
Fast influencers in complex networks
Authors:
Fang Zhou,
Linyuan Lü,
Manuel Sebastian Mariani
Abstract:
Influential nodes in complex networks are typically defined as those nodes that maximize the asymptotic reach of a spreading process of interest. However, for practical applications such as viral marketing and online information spreading, one is often interested in maximizing the reach of the process in a short amount of time. The traditional definition of influencers in network-related studies f…
▽ More
Influential nodes in complex networks are typically defined as those nodes that maximize the asymptotic reach of a spreading process of interest. However, for practical applications such as viral marketing and online information spreading, one is often interested in maximizing the reach of the process in a short amount of time. The traditional definition of influencers in network-related studies from diverse research fields narrows down the focus to the late-time state of the spreading processes, leaving the following question unsolved: which nodes are able to initiate large-scale spreading processes, in a limited amount of time? Here, we find that there is a fundamental difference between the nodes -- which we call "fast influencers" -- that initiate the largest-reach processes in a short amount of time, and the traditional, "late-time" influencers. Stimulated by this observation, we provide an extensive benchmarking of centrality metrics with respect to their ability to identify both the fast and late-time influencers. We find that local network properties can be used to uncover the fast influencers. In particular, a parsimonious, local centrality metric (which we call social capital) achieves optimal or nearly-optimal performance in the fast influencer identification for all the analyzed empirical networks. Local metrics tend to be also competitive in the traditional, late-time influencer identification task.
△ Less
Submitted 15 March, 2019;
originally announced March 2019.
-
Optimal timescale for community detection in growing networks
Authors:
Matus Medo,
An Zeng,
Yi-Cheng Zhang,
Manuel S. Mariani
Abstract:
Time-stamped data are increasingly available for many social, economic, and information systems that can be represented as networks growing with time. The World Wide Web, social contact networks, and citation networks of scientific papers and online news articles, for example, are of this kind. Static methods can be inadequate for the analysis of growing networks as they miss essential information…
▽ More
Time-stamped data are increasingly available for many social, economic, and information systems that can be represented as networks growing with time. The World Wide Web, social contact networks, and citation networks of scientific papers and online news articles, for example, are of this kind. Static methods can be inadequate for the analysis of growing networks as they miss essential information on the system's dynamics. At the same time, time-aware methods require the choice of an observation timescale, yet we lack principled ways to determine it. We focus on the popular community detection problem which aims to partition a network's nodes into meaningful groups. We use a multi-layer quality function to show, on both synthetic and real datasets, that the observation timescale that leads to optimal communities is tightly related to the system's intrinsic aging timescale that can be inferred from the time-stamped network data. The use of temporal information leads to drastically different conclusions on the community structure of real information networks, which challenges the current understanding of the large-scale organization of growing networks. Our findings indicate that before attempting to assess structural patterns of evolving networks, it is vital to uncover the timescales of the dynamical processes that generated them.
△ Less
Submitted 1 August, 2019; v1 submitted 13 September, 2018;
originally announced September 2018.
-
The long-term impact of ranking algorithms in growing networks
Authors:
Shilun Zhang,
Matúš Medo,
Linyuan Lü,
Manuel Sebastian Mariani
Abstract:
When we search online for content, we are constantly exposed to rankings. For example, web search results are presented as a ranking, and online bookstores often show us lists of best-selling books. While popularity-based ranking algorithms (like Google's PageRank) have been extensively studied in previous works, we still lack a clear understanding of their potential systemic consequences. In this…
▽ More
When we search online for content, we are constantly exposed to rankings. For example, web search results are presented as a ranking, and online bookstores often show us lists of best-selling books. While popularity-based ranking algorithms (like Google's PageRank) have been extensively studied in previous works, we still lack a clear understanding of their potential systemic consequences. In this work, we fill this gap by introducing a new model of network growth that allows us to compare the properties of the networks generated under the influence of different ranking algorithms. We show that by correcting for the omnipresent age bias of popularity-based ranking algorithms, the resulting networks exhibit a significantly larger agreement between the nodes' inherent quality and their long-term popularity, and a less concentrated popularity distribution. To further promote popularity diversity, we introduce and validate a perturbation of the original rankings where a small number of randomly-selected nodes are promoted to the top of the ranking. Our findings move the first steps toward a model-based understanding of the long-term impact of popularity-based ranking algorithms, and could be used as an informative tool for the design of improved information filtering tools.
△ Less
Submitted 19 November, 2018; v1 submitted 31 May, 2018;
originally announced May 2018.
-
Influencers identification in complex networks through reaction-diffusion dynamics
Authors:
Flavio Iannelli,
Manuel Sebastian Mariani,
Igor M. Sokolov
Abstract:
A pivotal idea in network science, marketing research and innovation diffusion theories is that a small group of nodes -- called influencers -- have the largest impact on social contagion and epidemic processes in networks. Despite the long-standing interest in the influencers identification problem in socio-economic and biological networks, there is not yet agreement on which is the best identifi…
▽ More
A pivotal idea in network science, marketing research and innovation diffusion theories is that a small group of nodes -- called influencers -- have the largest impact on social contagion and epidemic processes in networks. Despite the long-standing interest in the influencers identification problem in socio-economic and biological networks, there is not yet agreement on which is the best identification strategy. State-of-the-art strategies are typically based either on heuristic centrality metrics or on analytic arguments that only hold for specific network topologies or peculiar dynamical regimes. Here, we leverage the recently introduced random-walk effective distance -- a topological metric that estimates almost perfectly the arrival time of diffusive spreading processes on networks -- to introduce a new centrality metric which quantifies how close a node is to the other nodes. We show that the new centrality metric significantly outperforms state-of-the-art metrics in detecting the influencers for global contagion processes. Our findings reveal the essential role of the network effective distance for the influencers identification and lead us closer to the optimal solution of the problem.
△ Less
Submitted 14 November, 2018; v1 submitted 3 March, 2018;
originally announced March 2018.
-
Revealing In-Block Nestedness: detection and benchmarking
Authors:
Albert Solé-Ribalta,
Claudio J. Tessone,
Manuel S. Mariani,
Javier Borge-Holthoefer
Abstract:
As new instances of nested organization --beyond ecological networks-- are discovered, scholars are debating around the co-existence of two apparently incompatible macroscale architectures: nestedness and modularity. The discussion is far from being solved, mainly for two reasons. First, nestedness and modularity appear to emerge from two contradictory dynamics, cooperation and competition. Second…
▽ More
As new instances of nested organization --beyond ecological networks-- are discovered, scholars are debating around the co-existence of two apparently incompatible macroscale architectures: nestedness and modularity. The discussion is far from being solved, mainly for two reasons. First, nestedness and modularity appear to emerge from two contradictory dynamics, cooperation and competition. Second, existing methods to assess the presence of nestedness and modularity are flawed when it comes to the evaluation of concurrently nested and modular structures. In this work, we tackle the latter problem, presenting the concept of \textit{in-block nestedness}, a structural property determining to what extent a network is composed of blocks whose internal connectivity exhibits nestedness. We then put forward a set of optimization methods that allow us to identify such organization successfully, both in synthetic and in a large number of real networks. These findings challenge our understanding of the topology of ecological and social systems, calling for new models to explain how such patterns emerge.
△ Less
Submitted 17 January, 2018;
originally announced January 2018.
-
Early identification of important patents through network centrality
Authors:
Manuel Sebastian Mariani,
Matus Medo,
François Lafond
Abstract:
One of the most challenging problems in technological forecasting is to identify as early as possible those technologies that have the potential to lead to radical changes in our society. In this paper, we use the US patent citation network (1926-2010) to test our ability to early identify a list of historically significant patents through citation network analysis. We show that in order to effect…
▽ More
One of the most challenging problems in technological forecasting is to identify as early as possible those technologies that have the potential to lead to radical changes in our society. In this paper, we use the US patent citation network (1926-2010) to test our ability to early identify a list of historically significant patents through citation network analysis. We show that in order to effectively uncover these patents shortly after they are issued, we need to go beyond raw citation counts and take into account both the citation network topology and temporal information. In particular, an age-normalized measure of patent centrality, called rescaled PageRank, allows us to identify the significant patents earlier than citation count and PageRank score. In addition, we find that while high-impact patents tend to rely on other high-impact patents in a similar way as scientific papers, the patents' citation dynamics is significantly slower than that of papers, which makes the early identification of significant patents more challenging than that of significant papers.
△ Less
Submitted 25 October, 2017;
originally announced October 2017.
-
DarkSide-20k: A 20 Tonne Two-Phase LAr TPC for Direct Dark Matter Detection at LNGS
Authors:
C. E. Aalseth,
F. Acerbi,
P. Agnes,
I. F. M. Albuquerque,
T. Alexander,
A. Alici,
A. K. Alton,
P. Antonioli,
S. Arcelli,
R. Ardito,
I. J. Arnquist,
D. M. Asner,
M. Ave,
H. O. Back,
A. I. Barrado Olmedo,
G. Batignani,
E. Bertoldo,
S. Bettarini,
M. G. Bisogni,
V. Bocci,
A. Bondar,
G. Bonfini,
W. Bonivento,
M. Bossa,
B. Bottino
, et al. (260 additional authors not shown)
Abstract:
Building on the successful experience in operating the DarkSide-50 detector, the DarkSide Collaboration is going to construct DarkSide-20k, a direct WIMP search detector using a two-phase Liquid Argon Time Projection Chamber (LArTPC) with an active (fiducial) mass of 23 t (20 t). The DarkSide-20k LArTPC will be deployed within a shield/veto with a spherical Liquid Scintillator Veto (LSV) inside a…
▽ More
Building on the successful experience in operating the DarkSide-50 detector, the DarkSide Collaboration is going to construct DarkSide-20k, a direct WIMP search detector using a two-phase Liquid Argon Time Projection Chamber (LArTPC) with an active (fiducial) mass of 23 t (20 t). The DarkSide-20k LArTPC will be deployed within a shield/veto with a spherical Liquid Scintillator Veto (LSV) inside a cylindrical Water Cherenkov Veto (WCV). Operation of DarkSide-50 demonstrated a major reduction in the dominant $^{39}$Ar background when using argon extracted from an underground source, before applying pulse shape analysis. Data from DarkSide-50, in combination with MC simulation and analytical modeling, shows that a rejection factor for discrimination between electron and nuclear recoils of $\gt3\times10^9$ is achievable. This, along with the use of the veto system, is the key to unlocking the path to large LArTPC detector masses, while maintaining an "instrumental background-free" experiment, an experiment in which less than 0.1 events (other than $ν$-induced nuclear recoils) is expected to occur within the WIMP search region during the planned exposure. DarkSide-20k will have ultra-low backgrounds than can be measured in situ. This will give sensitivity to WIMP-nucleon cross sections of $1.2\times10^{-47}$ cm$^2$ ($1.1\times10^{-46}$ cm$^2$) for WIMPs of $1$ TeV$/c^2$ ($10$ TeV$/c^2$) mass, to be achieved during a 5 yr run producing an exposure of 100 t yr free from any instrumental background. DarkSide-20k could then extend its operation to a decade, increasing the exposure to 200 t yr, reaching a sensitivity of $7.4\times10^{-48}$ cm$^2$ ($6.9\times10^{-47}$ cm$^2$) for WIMPs of $1$ TeV$/c^2$ ($10$ TeV$/c^2$) mass.
△ Less
Submitted 25 July, 2017;
originally announced July 2017.
-
Cryogenic Characterization of FBK RGB-HD SiPMs
Authors:
C. E. Aalseth,
F. Acerbi,
P. Agnes,
I. F. M. Albuquerque,
T. Alexander,
A. Alici,
A. K. Alton,
P. Ampudia,
P. Antonioli,
S. Arcelli,
R. Ardito,
I. J. Arnquist,
D. M. Asner,
H. O. Back,
G. Batignani,
E. Bertoldo,
S. Bettarini,
M. G. Bisogni,
V. Bocci,
A. Bondar,
G. Bonfini,
W. Bonivento,
M. Bossa,
B. Bottino,
R. Bunker
, et al. (246 additional authors not shown)
Abstract:
We report on the cryogenic characterization of Red Green Blue - High Density (RGB-HD) SiPMs developed at Fondazione Bruno Kessler (FBK) as part of the DarkSide program of dark matter searches with liquid argon time projection chambers. A dedicated setup was used to measure the primary dark noise, the correlated noise, and the gain of the SiPMs at varying temperatures. A custom-made data acquisitio…
▽ More
We report on the cryogenic characterization of Red Green Blue - High Density (RGB-HD) SiPMs developed at Fondazione Bruno Kessler (FBK) as part of the DarkSide program of dark matter searches with liquid argon time projection chambers. A dedicated setup was used to measure the primary dark noise, the correlated noise, and the gain of the SiPMs at varying temperatures. A custom-made data acquisition system and analysis software were used to precisely characterize these parameters. We demonstrate that FBK RGB-HD SiPMs with low quenching resistance (RGB-HD-LR$_q$) can be operated from 40 K to 300 K with gains in the range $10^5$ to $10^6$ and noise rates on the order of a few Hz/mm$^2$.
△ Less
Submitted 12 September, 2017; v1 submitted 19 May, 2017;
originally announced May 2017.
-
Ranking in evolving complex networks
Authors:
Hao Liao,
Manuel Sebastian Mariani,
Matus Medo,
Yi-Cheng Zhang,
Ming-Yang Zhou
Abstract:
Complex networks have emerged as a simple yet powerful framework to represent and analyze a wide range of complex systems. The problem of ranking the nodes and the edges in complex networks is critical for a broad range of real-world problems because it affects how we access online information and products, how success and talent are evaluated in human activities, and how scarce resources are allo…
▽ More
Complex networks have emerged as a simple yet powerful framework to represent and analyze a wide range of complex systems. The problem of ranking the nodes and the edges in complex networks is critical for a broad range of real-world problems because it affects how we access online information and products, how success and talent are evaluated in human activities, and how scarce resources are allocated by companies and policymakers, among others. This calls for a deep understanding of how existing ranking algorithms perform, and which are their possible biases that may impair their effectiveness. Well-established ranking algorithms (such as the popular Google's PageRank) are static in nature and, as a consequence, they exhibit important shortcomings when applied to real networks that rapidly evolve in time. The recent advances in the understanding and modeling of evolving networks have enabled the development of a wide and diverse range of ranking algorithms that take the temporal dimension into account. The aim of this review is to survey the existing ranking algorithms, both static and time-aware, and their applications to evolving networks. We emphasize both the impact of network evolution on well-established static algorithms and the benefits from including the temporal dimension for tasks such as prediction of real network traffic, prediction of future links, and identification of highly-significant nodes.
△ Less
Submitted 26 April, 2017;
originally announced April 2017.
-
Quantifying and suppressing ranking bias in a large citation network
Authors:
Giacomo Vaccario,
Matus Medo,
Nicolas Wider,
Manuel Sebastian Mariani
Abstract:
It is widely recognized that citation counts for papers from different fields cannot be directly compared because different scientific fields adopt different citation practices. Citation counts are also strongly biased by paper age since older papers had more time to attract citations. Various procedures aim at suppressing these biases and give rise to new normalized indicators, such as the relati…
▽ More
It is widely recognized that citation counts for papers from different fields cannot be directly compared because different scientific fields adopt different citation practices. Citation counts are also strongly biased by paper age since older papers had more time to attract citations. Various procedures aim at suppressing these biases and give rise to new normalized indicators, such as the relative citation count. We use a large citation dataset from Microsoft Academic Graph and a new statistical framework based on the Mahalanobis distance to show that the rankings by well known indicators, including the relative citation count and Google's PageRank score, are significantly biased by paper field and age. We propose a general normalization procedure motivated by the $z$-score which produces much less biased rankings when applied to citation count and PageRank score.
△ Less
Submitted 23 March, 2017;
originally announced March 2017.
-
Randomizing growing networks with a time-respecting null model
Authors:
Zhuo-Ming Ren,
Manuel Sebastian Mariani,
Yi-Cheng Zhang,
Matus Medo
Abstract:
Complex networks are often used to represent systems that are not static but grow with time: people make new friendships, new papers are published and refer to the existing ones, and so forth. To assess the statistical significance of measurements made on such networks, we propose a randomization methodology---a time-respecting null model---that preserves both the network's degree sequence and the…
▽ More
Complex networks are often used to represent systems that are not static but grow with time: people make new friendships, new papers are published and refer to the existing ones, and so forth. To assess the statistical significance of measurements made on such networks, we propose a randomization methodology---a time-respecting null model---that preserves both the network's degree sequence and the time evolution of individual nodes' degree values. By preserving the temporal linking patterns of the analyzed system, the proposed model is able to factor out the effect of the system's temporal patterns on its structure. We apply the model to the citation network of Physical Review scholarly papers and the citation network of US movies. The model reveals that the two datasets are strikingly different with respect to their degree-degree correlations, and we discuss the important implications of this finding on the information provided by paradigmatic node centrality metrics such as indegree and Google's PageRank. The randomization methodology proposed here can be used to assess the significance of any structural property in growing networks, which could bring new insights into the problems where null models play a critical role, such as the detection of communities and network motifs.
△ Less
Submitted 16 November, 2017; v1 submitted 22 March, 2017;
originally announced March 2017.
-
Identification of milestone papers through time-balanced network centrality
Authors:
Manuel Sebastian Mariani,
Matus Medo,
Yi-Cheng Zhang
Abstract:
Citations between scientific papers and related bibliometric indices, such as the $h$-index for authors and the impact factor for journals, are being increasingly used - often in controversial ways - as quantitative tools for research evaluation. Yet, a fundamental research question remains still open: to which extent do quantitative metrics capture the significance of scientific works? We analyze…
▽ More
Citations between scientific papers and related bibliometric indices, such as the $h$-index for authors and the impact factor for journals, are being increasingly used - often in controversial ways - as quantitative tools for research evaluation. Yet, a fundamental research question remains still open: to which extent do quantitative metrics capture the significance of scientific works? We analyze the network of citations among the $449,935$ papers published by the American Physical Society (APS) journals between 1893 and 2009, and focus on the comparison of metrics built on the citation count with network-based metrics. We contrast five article-level metrics with respect to the rankings that they assign to a set of fundamental papers, called Milestone Letters, carefully selected by the APS editors for "making long-lived contributions to physics, either by announcing significant discoveries, or by initiating new areas of research". A new metric, which combines PageRank centrality with the explicit requirement that paper score is not biased by paper age, is the best-performing metric overall in identifying the Milestone Letters. The lack of time bias in the new metric makes it also possible to use it to compare papers of different age on the same scale. We find that network-based metrics identify the Milestone Letters better than metrics based on the citation count, which suggests that the structure of the citation network contains information that can be used to improve the ranking of scientific publications. The methods and results presented here are relevant for all evolving systems where network centrality metrics are applied, for example the World Wide Web and online social networks. An interactive Web platform where it is possible to view the ranking of the APS papers by rescaled PageRank is available at the address \url{http://www.sciencenow.info}.
△ Less
Submitted 8 November, 2016; v1 submitted 30 August, 2016;
originally announced August 2016.
-
Identification and modeling of discoverers in online social systems
Authors:
Matus Medo,
Manuel S. Mariani,
An Zeng,
Yi-Cheng Zhang
Abstract:
The dynamics of individuals is of essential importance for understanding the evolution of social systems. Most existing models assume that individuals in diverse systems, ranging from social networks to e-commerce, all tend to what is already popular. We develop an analytical time-aware framework which shows that when individuals make choices -- which item to buy, for example -- in online social s…
▽ More
The dynamics of individuals is of essential importance for understanding the evolution of social systems. Most existing models assume that individuals in diverse systems, ranging from social networks to e-commerce, all tend to what is already popular. We develop an analytical time-aware framework which shows that when individuals make choices -- which item to buy, for example -- in online social systems, a small fraction of them is consistently successful in discovering popular items long before they actually become popular. We argue that these users, whom we refer to as discoverers, are fundamentally different from the previously known opinion leaders, influentials, and innovators. We use the proposed framework to demonstrate that discoverers are present in a wide range of systems. Once identified, they can be used to predict the future success of items. We propose a network model which reproduces the discovery patterns observed in the real data. Furthermore, data produced by the model pose a fundamental challenge to classical ranking algorithms which neglect the time of link creation and thus fail to discriminate between discoverers and ordinary users in the data. Our results open the door to qualitative and quantitative study of fine temporal patterns in social systems and have far-reaching implications for network modeling and algorithm design.
△ Less
Submitted 4 September, 2015;
originally announced September 2015.
-
Ranking nodes in growing networks: When PageRank fails
Authors:
Manuel Sebastian Mariani,
Matus Medo,
Yi-Cheng Zhang
Abstract:
PageRank is arguably the most popular ranking algorithm which is being applied in real systems ranging from information to biological and infrastructure networks. Despite its outstanding popularity and broad use in different areas of science, the relation between the algorithm's efficacy and properties of the network on which it acts has not yet been fully understood. We study here PageRank's perf…
▽ More
PageRank is arguably the most popular ranking algorithm which is being applied in real systems ranging from information to biological and infrastructure networks. Despite its outstanding popularity and broad use in different areas of science, the relation between the algorithm's efficacy and properties of the network on which it acts has not yet been fully understood. We study here PageRank's performance on a network model supported by real data, and show that realistic temporal effects make PageRank fail in individuating the most valuable nodes for a broad range of model parameters. Results on real data are in qualitative agreement with our model-based findings. This failure of PageRank reveals that the static approach to information filtering is inappropriate for a broad class of growing systems, and suggest that time-dependent algorithms that are based on the temporal linking patterns of these systems are needed to better rank the nodes.
△ Less
Submitted 3 September, 2015;
originally announced September 2015.