Skip to main content

Showing 1–42 of 42 results for author: Clauset, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.03127  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Fast algorithms to improve fair information access in networks

    Authors: Dennis Robert Windham, Caroline J. Wendt, Alex Crane, Madelyn J Warr, Freda Shi, Sorelle A. Friedler, Blair D. Sullivan, Aaron Clauset

    Abstract: We consider the problem of selecting $k$ seed nodes in a network to maximize the minimum probability of activation under an independent cascade beginning at these seeds. The motivation is to promote fairness by ensuring that even the least advantaged members of the network have good access to information. Our problem can be viewed as a variant of the classic influence maximization objective, but i… ▽ More

    Submitted 19 February, 2025; v1 submitted 4 September, 2024; originally announced September 2024.

    Comments: 9 pages, 9 figures, and 3 appendices containing 12 algorithms, 1 table, and 9 additional figures

  2. Link Prediction Accuracy on Real-World Networks Under Non-Uniform Missing Edge Patterns

    Authors: Xie He, Amir Ghasemian, Eun Lee, Alice Schwarze, Aaron Clauset, Peter J. Mucha

    Abstract: Real-world network datasets are typically obtained in ways that fail to capture all edges. The patterns of missing data are often non-uniform as they reflect biases and other shortcomings of different data collection methods. Nevertheless, uniform missing data is a common assumption made when no additional information is available about the underlying missing-edge pattern, and link prediction meth… ▽ More

    Submitted 30 April, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Submitted to PLOS ONE

    Journal ref: PLoS ONE 19(7): e0306883 (2024)

  3. arXiv:2309.04414  [pdf, other

    stat.AP cs.DL

    Scientific productivity as a random walk

    Authors: Sam Zhang, Nicholas LaBerge, Samuel F. Way, Daniel B. Larremore, Aaron Clauset

    Abstract: The expectation that scientific productivity follows regular patterns over a career underpins many scholarly evaluations, including hiring, promotion and tenure, awards, and grant funding. However, recent studies of individual productivity patterns reveal a puzzle: on the one hand, the average number of papers published per year robustly follows the "canonical trajectory" of a rapid rise to an ear… ▽ More

    Submitted 13 March, 2025; v1 submitted 8 September, 2023; originally announced September 2023.

    MSC Class: 62P25

  4. arXiv:2208.01714  [pdf, other

    cs.SI

    An Open-Source Cultural Consensus Approach to Name-Based Gender Classification

    Authors: Ian Van Buskirk, Aaron Clauset, Daniel B. Larremore

    Abstract: Name-based gender classification has enabled hundreds of otherwise infeasible scientific studies of gender. Yet, the lack of standardization, proliferation of ad hoc methods, reliance on paid services, understudied limitations, and conceptual debates cast a shadow over many applications. To address these problems we develop and evaluate an ensemble-based open-source method built on publicly availa… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  5. arXiv:2204.05989  [pdf, other

    cs.DL

    Labor advantages drive the greater productivity of faculty at elite universities

    Authors: Sam Zhang, K. Hunter Wapman, Daniel B. Larremore, Aaron Clauset

    Abstract: Faculty at prestigious institutions dominate scientific discourse, with the small proportion of researchers at elite universities producing a disproportionate share of all research publications. Environmental prestige is known to drive such epistemic disparity, but the mechanisms by which it causes increased faculty productivity remain unknown. Here we combine employment, publication, and federal… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 22 pages, 11 figures

    ACM Class: J.4; K.4

  6. arXiv:2201.00254  [pdf, other

    cs.CY

    Subfield prestige and gender inequality in computing

    Authors: Nicholas LaBerge, K. Hunter Wapman, Allison C. Morgan, Sam Zhang, Daniel B. Larremore, Aaron Clauset

    Abstract: Women and people of color remain dramatically underrepresented among computing faculty, and improvements in demographic diversity are slow and uneven. Effective diversification strategies depend on quantifying the correlates, causes, and trends of diversity in the field. But field-level demographic changes are driven by subfield hiring dynamics because faculty searches are typically at the subfiel… ▽ More

    Submitted 9 May, 2022; v1 submitted 1 January, 2022; originally announced January 2022.

    Comments: 20 pages, 12 figures, 5 tables

  7. arXiv:2105.12120  [pdf, other

    cs.SI physics.data-an stat.ME

    Sampling random graphs with specified degree sequences

    Authors: Upasana Dutta, Bailey K. Fosdick, Aaron Clauset

    Abstract: The configuration model is a standard tool for uniformly generating random graphs with a specified degree sequence, and is often used as a null model to evaluate how much of an observed network's structure can be explained by its degree structure alone. A Markov chain Monte Carlo (MCMC) algorithm, based on a degree-preserving double-edge swap, provides an asymptotic solution to sample from the con… ▽ More

    Submitted 29 May, 2023; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Same as version v3 but with corrected white spaces between paragraphs

  8. arXiv:2105.02949  [pdf, other

    physics.soc-ph cs.CY cs.SI

    The Dynamics of Faculty Hiring Networks

    Authors: Eun Lee, Aaron Clauset, Daniel B. Larremore

    Abstract: Faculty hiring networks-who hires whose graduates as faculty-exhibit steep hierarchies, which can reinforce both social and epistemic inequalities in academia. Understanding the mechanisms driving these patterns would inform efforts to diversify the academy and shed new light on the role of hiring in shaping which scientific discoveries are made. Here, we investigate the degree to which structural… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Journal ref: EPJ Data Science 10, 48 (2021)

  9. arXiv:2011.12843  [pdf, other

    cs.SI cs.CY cs.IR

    Examining the consumption of radical content on YouTube

    Authors: Homa Hosseinmardi, Amir Ghasemian, Aaron Clauset, Markus Mobius, David M. Rothschild, Duncan J. Watts

    Abstract: Although it is under-studied relative to other social media platforms, YouTube is arguably the largest and most engaging online media consumption platform in the world. Recently, YouTube's scale has fueled concerns that YouTube users are being radicalized via a combination of biased recommendations and ostensibly apolitical anti-woke channels, both of which have been claimed to direct attention to… ▽ More

    Submitted 14 February, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 10 pages, 7 figures, 2 tables

  10. arXiv:1909.07578  [pdf, other

    stat.ML cs.LG cs.SI physics.data-an q-bio.MN

    Stacking Models for Nearly Optimal Link Prediction in Complex Networks

    Authors: Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, Aaron Clauset

    Abstract: Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speedup the collection of network data and improve the validity of network models. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability v… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: 30 pages, 9 figures, 22 tables

    Journal ref: Proc. Natl. Acad. Sci. USA 117(38), 23393-23400 (2020)

  11. arXiv:1904.04948  [pdf, other

    cs.SI cs.CY

    Environmental Changes and the Dynamics of Musical Identity

    Authors: Samuel F. Way, Santiago Gil, Ian Anderson, Aaron Clauset

    Abstract: Musical tastes reflect our unique values and experiences, our relationships with others, and the places where we live. But as each of these things changes, do our tastes also change to reflect the present, or remain fixed, reflecting our past? Here, we investigate how where a person lives shapes their musical preferences, using geographic relocation to construct quasi-natural experiments that meas… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: Accepted to be published at ICWSM'19

  12. arXiv:1810.08988  [pdf, other

    cs.CY physics.soc-ph

    Predicting the outcomes of policy diffusion from U.S. states to federal law

    Authors: Nora Connor, Aaron Clauset

    Abstract: In the United States, national policies often begin as state laws, which then spread from state to state until they gain momentum to become enacted as a national policy. However, not every state policy reaches the national level. Previous work has suggested that state-level policies are more likely to become national policies depending on their geographic origin, their category of legislation, or… ▽ More

    Submitted 21 October, 2018; originally announced October 2018.

  13. arXiv:1806.07005  [pdf, ps, other

    physics.soc-ph cond-mat.dis-nn cs.SI physics.data-an

    Thermodynamics of the Minimum Description Length on Community Detection

    Authors: Juan Ignacio Perotti, Claudio Juan Tessone, Aaron Clauset, Guido Caldarelli

    Abstract: Modern statistical modeling is an important complement to the more traditional approach of physics where Complex Systems are studied by means of extremely simple idealized models. The Minimum Description Length (MDL) is a principled approach to statistical modeling combining Occam's razor with Information Theory for the selection of models providing the most concise descriptions. In this work, we… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 13 pages and 4 figures

  14. arXiv:1805.09966  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Prestige drives epistemic inequality in the diffusion of scientific ideas

    Authors: Allison C. Morgan, Dimitrios J. Economou, Samuel F. Way, Aaron Clauset

    Abstract: The spread of ideas in the scientific community is often viewed as a competition, in which good ideas spread further because of greater intrinsic fitness, and publication venue and citation counts correlate with importance and impact. However, relatively little is known about how structural factors influence the spread of ideas, and specifically how where an idea originates might influence how it… ▽ More

    Submitted 22 October, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: 10 pages, 8 figures, 1 table

    Journal ref: EPJ Data Science 7, 40 (2018)

  15. arXiv:1804.02760  [pdf, other

    cs.DL cs.CY cs.SI physics.soc-ph

    Automatically assembling a full census of an academic field

    Authors: Allison C. Morgan, Samuel F. Way, Aaron Clauset

    Abstract: The composition of the scientific workforce shapes the direction of scientific research, directly through the selection of questions to investigate, and indirectly through its influence on the training of future scientists. In most fields, however, complete census information is difficult to obtain, complicating efforts to study workforce dynamics and the effects of policy. This is particularly tr… ▽ More

    Submitted 26 April, 2018; v1 submitted 8 April, 2018; originally announced April 2018.

    Comments: 11 pages, 6 figures, 2 tables

    Journal ref: PLoS ONE 13(8), e0202223 (2018)

  16. arXiv:1802.10582  [pdf, other

    stat.ML cs.SI physics.data-an q-bio.MN

    Evaluating Overfit and Underfit in Models of Network Community Structure

    Authors: Amir Ghasemian, Homa Hosseinmardi, Aaron Clauset

    Abstract: A common data mining task on networks is community detection, which seeks an unsupervised decomposition of a network into structural groups based on statistical regularities in the network's connectivity. Although many methods exist, the No Free Lunch theorem for community detection implies that each makes some kind of tradeoff, and no algorithm can be optimal on all inputs. Thus, different algori… ▽ More

    Submitted 16 April, 2019; v1 submitted 28 February, 2018; originally announced February 2018.

    Comments: 22 pages, 13 figures, 3 tables

    Journal ref: IEEE Trans. Knowledge and Data Engineering 32(9), 1722-1735 (2019)

  17. arXiv:1801.03400  [pdf, other

    physics.soc-ph cs.SI physics.data-an q-bio.MN stat.AP

    Scale-free networks are rare

    Authors: Anna D. Broido, Aaron Clauset

    Abstract: A central claim in modern network science is that real-world networks are typically "scale free," meaning that the fraction of nodes with degree $k$ follows a power law, decaying like $k^{-α}$, often with $2 < α< 3$. However, empirical evidence for this belief derives from a relatively small number of real-world networks. We test the universality of scale-free structure by applying state-of-the-ar… ▽ More

    Submitted 8 January, 2018; originally announced January 2018.

    Comments: 14 pages, 9 figures, 2 tables, 5 appendices

    Journal ref: Nature Communications 10, 1017 (2019)

  18. arXiv:1710.11304  [pdf, other

    cs.SI physics.data-an q-bio.MN stat.ML

    Characterizing the structural diversity of complex networks across domains

    Authors: Kansuke Ikehara, Aaron Clauset

    Abstract: The structure of complex networks has been of interest in many scientific and engineering disciplines over the decades. A number of studies in the field have been focused on finding the common properties among different kinds of networks such as heavy-tail degree distribution, small-worldness and modular structure and they have tried to establish a theory of structural universality in complex netw… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

    Comments: 23 pages, 11 figures, 2 tables; originally published as K. Ikehara, "The Structure of Complex Networks across Domains." MS Thesis, University of Colorado Boulder (2016)

  19. arXiv:1612.08228  [pdf, other

    cs.DL physics.soc-ph

    The misleading narrative of the canonical faculty productivity trajectory

    Authors: Samuel F. Way, Allison C. Morgan, Aaron Clauset, Daniel B. Larremore

    Abstract: A scientist may publish tens or hundreds of papers over a career, but these contributions are not evenly spaced in time. Sixty years of studies on career productivity patterns in a variety of fields suggest an intuitive and universal pattern: productivity tends to rise rapidly to an early peak and then gradually declines. Here, we test the universality of this conventional narrative by analyzing t… ▽ More

    Submitted 17 October, 2017; v1 submitted 24 December, 2016; originally announced December 2016.

    Comments: 18 pages, 16 figures

    Journal ref: Proceedings of the National Academy of Sciences 114.44 (2017): E9216-E9223

  20. arXiv:1608.05878  [pdf, other

    cs.SI physics.data-an physics.soc-ph stat.ML

    The ground truth about metadata and community detection in networks

    Authors: Leto Peel, Daniel B. Larremore, Aaron Clauset

    Abstract: Across many scientific domains, there is a common need to automatically extract a simplified view or coarse-graining of how a complex system's components interact. This general task is called community detection in networks and is analogous to searching for clusters in independent vector data. It is common to evaluate the performance of community detection algorithms by their ability to find so-ca… ▽ More

    Submitted 3 May, 2017; v1 submitted 20 August, 2016; originally announced August 2016.

    Comments: 27 pages, 10 figures, 11 tables

    Journal ref: Science Advances 3(5) e1602548, 2017

  21. arXiv:1602.00795  [pdf, other

    cs.SI cs.CY physics.soc-ph stat.AP

    Gender, Productivity, and Prestige in Computer Science Faculty Hiring Networks

    Authors: Samuel F. Way, Daniel B. Larremore, Aaron Clauset

    Abstract: Women are dramatically underrepresented in computer science at all levels in academia and account for just 15% of tenure-track faculty. Understanding the causes of this gender imbalance would inform both policies intended to rectify it and employment decisions by departments and individuals. Progress in this direction, however, is complicated by the complexity and decentralized nature of faculty h… ▽ More

    Submitted 2 February, 2016; originally announced February 2016.

    Comments: 11 pages, 7 figures, 5 tables

    Journal ref: Proc. 2016 World Wide Web Conference (WWW), 1169-1179 (2016)

  22. arXiv:1507.04001  [pdf, ps, other

    cs.SI physics.data-an physics.soc-ph stat.ML

    Structure and inference in annotated networks

    Authors: M. E. J. Newman, Aaron Clauset

    Abstract: For many networks of scientific interest we know both the connections of the network and information about the network nodes, such as the age or gender of individuals in a social network, geographic location of nodes in the Internet, or cellular function of nodes in a gene regulatory network. Here we demonstrate how this "metadata" can be used to improve our analysis and understanding of network s… ▽ More

    Submitted 14 July, 2015; originally announced July 2015.

    Comments: 16 pages, 7 figures, 1 table

    Journal ref: Nature Communications 7, 11863 (2016)

  23. arXiv:1507.01266  [pdf, other

    physics.soc-ph cs.SI nlin.AO physics.data-an

    Eigenvector-Based Centrality Measures for Temporal Networks

    Authors: Dane Taylor, Sean A. Myers, Aaron Clauset, Mason A. Porter, Peter J. Mucha

    Abstract: Numerous centrality measures have been developed to quantify the importances of nodes in time-independent networks, and many of them can be expressed as the leading eigenvector of some matrix. With the increasing availability of network data that changes in time, it is important to extend such eigenvector-based centrality measures to time-dependent networks. In this paper, we introduce a principle… ▽ More

    Submitted 21 September, 2016; v1 submitted 5 July, 2015; originally announced July 2015.

    Comments: 38 pages, 7 figures, and 5 tables

  24. arXiv:1506.06179  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG cs.SI physics.data-an

    Detectability thresholds and optimal algorithms for community structure in dynamic networks

    Authors: Amir Ghasemian, Pan Zhang, Aaron Clauset, Cristopher Moore, Leto Peel

    Abstract: We study the fundamental limits on learning latent community structure in dynamic networks. Specifically, we study dynamic stochastic block models where nodes change their community membership over time, but where edges are generated independently at each time step. In this setting (which is a special case of several existing models), we are able to derive the detectability threshold exactly, as a… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.

    Comments: 9 pages, 3 figures

    Journal ref: Phys. Rev. X 6, 031005 (2016)

  25. arXiv:1504.05872  [pdf, other

    physics.data-an cs.CY stat.AP

    Predicting sports scoring dynamics with restoration and anti-persistence

    Authors: Leto Peel, Aaron Clauset

    Abstract: Professional team sports provide an excellent domain for studying the dynamics of social competitions. These games are constructed with simple, well-defined rules and payoffs that admit a high-dimensional set of possible actions and nontrivial scoring dynamics. The resulting gameplay and efforts to predict its evolution are the object of great interest to both sports professionals and enthusiasts.… ▽ More

    Submitted 22 April, 2015; originally announced April 2015.

    Comments: 12 pages, 9 figures

    Journal ref: Proc. 2015 IEEE International Conference on Data Mining (ICDM), 339-348 2015

  26. arXiv:1503.06772  [pdf, other

    cs.SI physics.soc-ph

    Assembling thefacebook: Using heterogeneity to understand online social network assembly

    Authors: Abigail Z. Jacobs, Samuel F. Way, Johan Ugander, Aaron Clauset

    Abstract: Online social networks represent a popular and diverse class of social media systems. Despite this variety, each of these systems undergoes a general process of online social network assembly, which represents the complicated and heterogeneous changes that transform newly born systems into mature platforms. However, little is known about this process. For example, how much of a network's assembly… ▽ More

    Submitted 31 May, 2015; v1 submitted 23 March, 2015; originally announced March 2015.

    Comments: 13 pages, 11 figures, Proceedings of the 7th Annual ACM Web Science Conference (WebSci), 2015

  27. arXiv:1411.4070  [pdf, other

    stat.ML cs.LG cs.SI physics.soc-ph

    A unified view of generative models for networks: models, methods, opportunities, and challenges

    Authors: Abigail Z. Jacobs, Aaron Clauset

    Abstract: Research on probabilistic models of networks now spans a wide variety of fields, including physics, sociology, biology, statistics, and machine learning. These efforts have produced a diverse ecology of models and methods. Despite this diversity, many of these models share a common underlying structure: pairwise interactions (edges) are generated with probability conditional on latent vertex attri… ▽ More

    Submitted 14 November, 2014; originally announced November 2014.

    Comments: 10 pages. To appear at the NIPS 2014 Workshop on Networks: From Graphs to Rich Data

  28. arXiv:1404.0431  [pdf, other

    stat.ML cs.SI physics.data-an physics.soc-ph

    Learning Latent Block Structure in Weighted Networks

    Authors: Christopher Aicher, Abigail Z. Jacobs, Aaron Clauset

    Abstract: Community detection is an important task in network analysis, in which we aim to learn a network partition that groups together vertices with similar community-level connectivity patterns. By finding such groups of vertices with similar structural roles, we extract a compact representation of the network's large-scale structure, which can facilitate its scientific interpretation and the prediction… ▽ More

    Submitted 3 June, 2014; v1 submitted 1 April, 2014; originally announced April 2014.

    Comments: 28 Pages

    Journal ref: Journal of Complex Networks (2015) 3 (2): 221-248

  29. arXiv:1403.2933  [pdf, other

    cs.SI physics.data-an physics.soc-ph q-bio.QM stat.ML

    Efficiently inferring community structure in bipartite networks

    Authors: Daniel B. Larremore, Aaron Clauset, Abigail Z. Jacobs

    Abstract: Bipartite networks are a common type of network data in which there are two types of vertices, and only vertices of different types can be connected. While bipartite networks exhibit community structure like their unipartite counterparts, existing approaches to bipartite community detection have drawbacks, including implicit parameter choices, loss of information through one-mode projections, and… ▽ More

    Submitted 10 July, 2014; v1 submitted 12 March, 2014; originally announced March 2014.

    Comments: 12 pages, 9 figures

    Journal ref: Physical Review E 90(1): 012805 (2014)

  30. arXiv:1403.0989  [pdf, other

    cs.SI physics.soc-ph stat.ML

    Detecting change points in the large-scale structure of evolving networks

    Authors: Leto Peel, Aaron Clauset

    Abstract: Interactions among people or objects are often dynamic in nature and can be represented as a sequence of networks, each providing a snapshot of the interactions over a brief period of time. An important task in analyzing such evolving networks is change-point detection, in which we both identify the times at which the large-scale pattern of interactions changes fundamentally and quantify how large… ▽ More

    Submitted 14 November, 2014; v1 submitted 4 March, 2014; originally announced March 2014.

    Journal ref: Proc. of the 29th International Conference on Artificial Intelligence (AAAI), 2914-2920 (2015)

  31. arXiv:1310.4461  [pdf, other

    stat.AP cs.CY physics.data-an physics.soc-ph

    Scoring dynamics across professional team sports: tempo, balance and predictability

    Authors: Sears Merritt, Aaron Clauset

    Abstract: Despite growing interest in quantifying and modeling the scoring dynamics within professional sports games, relative little is known about what patterns or principles, if any, cut across different sports. Using a comprehensive data set of scoring events in nearly a dozen consecutive seasons of college and professional (American) football, professional hockey, and professional basketball, we identi… ▽ More

    Submitted 20 March, 2014; v1 submitted 16 October, 2013; originally announced October 2013.

    Comments: 18 pages, 8 figures, 4 tables, 2 appendices

    Journal ref: EPJ Data Science 3, 4 (2014)

  32. arXiv:1306.4363  [pdf, ps, other

    cs.SI physics.data-an physics.soc-ph

    Social Network Dynamics in a Massive Online Game: Network Turnover, Non-densification, and Team Engagement in Halo Reach

    Authors: Sears Merritt, Aaron Clauset

    Abstract: Online multiplayer games are a popular form of social interaction, used by hundreds of millions of individuals. However, little is known about the social networks within these online games, or how they evolve over time. Understanding human social dynamics within massive online games can shed new light on social interactions in general and inform the development of more engaging systems. Here, we s… ▽ More

    Submitted 18 June, 2013; originally announced June 2013.

    Comments: 8 pages, 13 figures

    ACM Class: H.2.8

  33. arXiv:1305.5782  [pdf, ps, other

    stat.ML cs.LG cs.SI physics.data-an

    Adapting the Stochastic Block Model to Edge-Weighted Networks

    Authors: Christopher Aicher, Abigail Z. Jacobs, Aaron Clauset

    Abstract: We generalize the stochastic block model to the important case in which edges are annotated with weights drawn from an exponential family distribution. This generalization introduces several technical difficulties for model estimation, which we solve using a Bayesian approach. We introduce a variational algorithm that efficiently approximates the model's posterior distribution for dense graphs. In… ▽ More

    Submitted 24 May, 2013; originally announced May 2013.

  34. arXiv:1304.1039  [pdf, ps, other

    physics.soc-ph cs.SI physics.data-an stat.AP

    Environmental structure and competitive scoring advantages in team competitions

    Authors: Sears Merritt, Aaron Clauset

    Abstract: In most professional sports, the structure of the environment is kept neutral so that scoring imbalances may be attributed to differences in team skill. It thus remains unknown what impact structural heterogeneities can have on scoring dynamics and producing competitive advantages. Applying a generative model of scoring dynamics to roughly 10 million team competitions drawn from an online game, we… ▽ More

    Submitted 3 April, 2013; originally announced April 2013.

    Comments: Main Text: 8 pages, 4 figures, 2 tables; Supplementary Information: 12 pages, 13 figures, 9 tables

    Journal ref: Scientific Reports 3, 3067 (2013)

  35. arXiv:1303.6372  [pdf, ps, other

    cs.SI cs.CY cs.HC physics.soc-ph

    Detecting Friendship Within Dynamic Online Interaction Networks

    Authors: Sears Merritt, Abigail Z. Jacobs, Winter Mason, Aaron Clauset

    Abstract: In many complex social systems, the timing and frequency of interactions between individuals are observable but friendship ties are hidden. Recovering these hidden ties, particularly for casual users who are relatively less active, would enable a wide variety of friendship-aware applications in domains where labeled data are often unavailable, including online advertising and national security. He… ▽ More

    Submitted 25 March, 2013; originally announced March 2013.

    Comments: To Appear at the 7th International AAAI Conference on Weblogs and Social Media (ICWSM '13), 11 pages, 1 table, 6 figures

    Journal ref: Proc. of the 7th International AAAI Conference on Weblogs and Social Media (ICWSM), 380 - 389 (2013)

  36. arXiv:1211.7343  [pdf, ps, other

    physics.data-an cs.SI physics.soc-ph

    Persistence and periodicity in a dynamic proximity network

    Authors: Aaron Clauset, Nathan Eagle

    Abstract: The topology of social networks can be understood as being inherently dynamic, with edges having a distinct position in time. Most characterizations of dynamic networks discretize time by converting temporal information into a sequence of network "snapshots" for further analysis. Here we study a highly resolved data set of a dynamic proximity network of 66 individuals. We show that the topology of… ▽ More

    Submitted 30 November, 2012; originally announced November 2012.

    Comments: 5 pages, 6 figures, part of the Reality Mining Project at http://realitycommons.media.mit.edu/ . Originally published in 2007; Proceedings of the DIMACS Workshop on Computational Methods for Dynamic Interaction Networks (Piscataway), 2007

  37. arXiv:1209.0089  [pdf, ps, other

    physics.data-an cs.LG physics.soc-ph stat.AP stat.ME

    Estimating the historical and future probabilities of large terrorist events

    Authors: Aaron Clauset, Ryan Woodard

    Abstract: Quantities with right-skewed distributions are ubiquitous in complex social systems, including political conflict, economics and social networks, and these systems sometimes produce extremely large events. For instance, the 9/11 terrorist events produced nearly 3000 fatalities, nearly six times more than the next largest event. But, was this enormous loss of life statistically unlikely given moder… ▽ More

    Submitted 8 January, 2014; v1 submitted 1 September, 2012; originally announced September 2012.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS614 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS614

    Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 4, 1838-1865

  38. arXiv:1203.2268  [pdf, other

    cs.SI cs.CY cs.HC physics.soc-ph

    Friends FTW! Friendship, Collaboration and Competition in Halo: Reach

    Authors: Winter Mason, Aaron Clauset

    Abstract: How important are friendships in determining success by individuals and teams in complex collaborative environments? By combining a novel data set containing the dynamics of millions of ad hoc teams from the popular multiplayer online first person shooter Halo: Reach with survey data on player demographics, play style, psychometrics and friendships derived from an anonymous online survey, we inves… ▽ More

    Submitted 25 February, 2013; v1 submitted 10 March, 2012; originally announced March 2012.

    Comments: 12 pages, 12 figures, 4 tables

    Journal ref: Proceedings of the 2013 Conference on Computer Supported Cooperative Work (CSCW '13), 375-386 (2013)

  39. arXiv:1103.0949  [pdf, other

    stat.ML cs.LG physics.data-an stat.ME

    Adapting to Non-stationarity with Growing Expert Ensembles

    Authors: Cosma Rohilla Shalizi, Abigail Z. Jacobs, Kristina Lisa Klinkner, Aaron Clauset

    Abstract: When dealing with time series with complex non-stationarities, low retrospective regret on individual realizations is a more appropriate goal than low prospective risk in expectation. Online learning algorithms provide powerful guarantees of this form, and have often been proposed for use with non-stationary processes because of their ability to switch between different forecasters or ``experts''.… ▽ More

    Submitted 28 June, 2011; v1 submitted 4 March, 2011; originally announced March 2011.

    Comments: 9 pages, 1 figure; CMU Statistics Technical Report. v2: Added empirical example, revised discussion of related work

  40. arXiv:physics/0610051  [pdf, ps, other

    physics.soc-ph cs.LG physics.data-an

    Structural Inference of Hierarchies in Networks

    Authors: Aaron Clauset, Cristopher Moore, M. E. J. Newman

    Abstract: One property of networks that has received comparatively little attention is hierarchy, i.e., the property of having vertices that cluster together in groups, which then join to form groups of groups, and so forth, up through all levels of organization in the network. Here, we give a precise definition of hierarchical structure, give a generic model for generating arbitrary hierarchical structur… ▽ More

    Submitted 9 October, 2006; originally announced October 2006.

    Comments: 8 pages, 8 figures

    Journal ref: Proc. 23rd International Conference on Machine Learning (ICML), Workshop on Social Network Analysis, Pittsburgh PA, June 2006

  41. arXiv:cond-mat/0503087  [pdf, ps, other

    cond-mat.dis-nn cs.NI math.CO math.PR

    On the Bias of Traceroute Sampling; or, Power-law Degree Distributions in Regular Graphs

    Authors: Dimitris Achlioptas, Aaron Clauset, David Kempe, Cristopher Moore

    Abstract: Understanding the structure of the Internet graph is a crucial step for building accurate network models and designing efficient algorithms for Internet applications. Yet, obtaining its graph structure is a surprisingly difficult task, as edges cannot be explicitly queried. Instead, empirical studies rely on traceroutes to build what are essentially single-source, all-destinations, shortest-path… ▽ More

    Submitted 29 March, 2006; v1 submitted 3 March, 2005; originally announced March 2005.

    Comments: Long-format version (19 pages); includes small correction to section 6.1

    Journal ref: Proc. 37th ACM Symposium on Theory of Computing (STOC) 2005

  42. arXiv:cond-mat/0410059  [pdf, ps, other

    cond-mat.dis-nn cs.NI physics.soc-ph

    Accuracy and Scaling Phenomena in Internet Mapping

    Authors: Aaron Clauset, Cristopher Moore

    Abstract: A great deal of effort has been spent measuring topological features of the Internet. However, it was recently argued that sampling based on taking paths or traceroutes through the network from a small number of sources introduces a fundamental bias in the observed degree distribution. We examine this bias analytically and experimentally. For Erdos-Renyi random graphs with mean degree c, we show… ▽ More

    Submitted 4 October, 2004; originally announced October 2004.

    Comments: 4 pages, 3 figures; supercedes cond-mat/0407339 and contains scaling results on the accuracy of multi-source traceroute studies

    Journal ref: Phys. Rev. Lett. 94, 018701 (2005)