Skip to main content

Showing 1–26 of 26 results for author: Peixoto, T P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.07736  [pdf, other

    stat.ML cs.LG cs.SI physics.data-an physics.soc-ph

    Uncertainty quantification and posterior sampling for network reconstruction

    Authors: Tiago P. Peixoto

    Abstract: Network reconstruction is the task of inferring the unseen interactions between elements of a system, based only on their behavior or dynamics. This inverse problem is in general ill-posed, and admits many solutions for the same observation. Nevertheless, the vast majority of statistical methods proposed for this task -- formulated as the inference of a graphical generative model -- can only produ… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 16 pages, 12 figures. Code available in https://graph-tool.skewed.de

  2. arXiv:2405.01015  [pdf, other

    stat.ML cs.LG cs.SI physics.data-an q-bio.PE

    Network reconstruction via the minimum description length principle

    Authors: Tiago P. Peixoto

    Abstract: A fundamental problem associated with the task of network reconstruction from dynamical or behavioral data consists in determining the most appropriate model complexity in a manner that prevents overfitting, and produces an inferred network with a statistically justifiable number of edges. The status quo in this context is based on $L_{1}$ regularization combined with cross-validation. However, be… ▽ More

    Submitted 21 March, 2025; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 18 pages, 9 figures. Code and documentation are available at https://graph-tool.skewed.de/static/doc/demos/reconstruction_indirect/reconstruction.html

    Journal ref: Phys. Rev. X 15, 011065 (2025)

  3. arXiv:2401.01404  [pdf, other

    cs.DS cs.LG physics.data-an stat.CO stat.ML

    Scalable network reconstruction in subquadratic time

    Authors: Tiago P. Peixoto

    Abstract: Network reconstruction consists in determining the unobserved pairwise couplings between $N$ nodes given only observational data on the resulting behavior that is conditioned on those couplings -- typically a time-series or independent samples from a graphical model. A major obstacle to the scalability of algorithms proposed for this problem is a seemingly unavoidable quadratic complexity of… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures. Code and documentation available at https://graph-tool.skewed.de/static/doc/demos/reconstruction_indirect/reconstruction.html

  4. arXiv:2210.09186  [pdf, other

    cs.SI cs.LG physics.data-an physics.soc-ph stat.ML

    Implicit models, latent compression, intrinsic biases, and cheap lunches in community detection

    Authors: Tiago P. Peixoto, Alec Kirkley

    Abstract: The task of community detection, which aims to partition a network into clusters of nodes to summarize its large-scale structure, has spawned the development of many competing algorithms with varying objectives. Some community detection methods are inferential, explicitly deriving the clustering objective through a probabilistic generative model, while other methods are descriptive, dividing a net… ▽ More

    Submitted 7 November, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 28 pages, 18 figures

    Journal ref: Phys. Rev. E 108, 024309 (2023)

  5. arXiv:2203.16460  [pdf, other

    cs.SI physics.data-an physics.soc-ph stat.ME

    Ordered community detection in directed networks

    Authors: Tiago P. Peixoto

    Abstract: We develop a method to infer community structure in directed networks where the groups are ordered in a latent one-dimensional hierarchy that determines the preferred edge direction. Our nonparametric Bayesian approach is based on a modification of the stochastic block model (SBM), which can take advantage of rank alignment and coherence to produce parsimonious descriptions of networks that combin… ▽ More

    Submitted 31 August, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: 20 pages, 8 figures, 1 table

    Journal ref: Phys. Rev. E 106, 024305 (2022)

  6. arXiv:2201.01658  [pdf, other

    physics.soc-ph physics.data-an stat.AP stat.ML

    Systematic assessment of the quality of fit of the stochastic block model for empirical networks

    Authors: Felipe Vaca-Ramírez, Tiago P. Peixoto

    Abstract: We perform a systematic analysis of the quality of fit of the stochastic block model (SBM) for 275 empirical networks spanning a wide range of domains and orders of size magnitude. We employ posterior predictive model checking as a criterion to assess the quality of fit, which involves comparing networks generated by the inferred model with the empirical network, according to a set of network desc… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: 21 pages, 9 figures

  7. arXiv:2112.00183  [pdf, other

    physics.soc-ph cs.SI physics.data-an stat.ME stat.ML

    Descriptive vs. inferential community detection in networks: pitfalls, myths, and half-truths

    Authors: Tiago P. Peixoto

    Abstract: Community detection is one of the most important methodological fields of network science, and one which has attracted a significant amount of attention over the past decades. This area deals with the automated division of a network into fundamental building blocks, with the objective of providing a summary of its large-scale structure. Despite its importance and widespread adoption, there is a no… ▽ More

    Submitted 6 July, 2023; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: 57 pages, 18 figures

    Journal ref: Elements in the Structure and Dynamics of Complex Networks, Cambridge University Press (2023)

  8. arXiv:2106.15821  [pdf, other

    cs.SI physics.soc-ph stat.ML

    Multilayer Networks for Text Analysis with Multiple Data Types

    Authors: Charles C. Hyland, Yuanming Tao, Lamiae Azizi, Martin Gerlach, Tiago P. Peixoto, Eduardo G. Altmann

    Abstract: We are interested in the widespread problem of clustering documents and finding topics in large collections of written documents in the presence of metadata and hyperlinks. To tackle the challenge of accounting for these different types of datasets, we propose a novel framework based on Multilayer Networks and Stochastic Block Models. The main innovation of our approach over other techniques is th… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: 17 pages, 6 figures

    Journal ref: EPJ Data Science volume 10, Article number: 33 (2021)

  9. arXiv:2101.02510  [pdf, other

    cs.SI physics.data-an physics.soc-ph stat.ML

    Disentangling homophily, community structure and triadic closure in networks

    Authors: Tiago P. Peixoto

    Abstract: Network homophily, the tendency of similar nodes to be connected, and transitivity, the tendency of two nodes being connected if they share a common neighbor, are conflated properties in network analysis, since one mechanism can drive the other. Here we present a generative model and corresponding inference procedure that are capable of distinguishing between both mechanisms. Our approach is based… ▽ More

    Submitted 6 January, 2022; v1 submitted 7 January, 2021; originally announced January 2021.

    Comments: 23 pages, 10 figures

    Journal ref: Phys. Rev. X 12, 011004 (2022)

  10. arXiv:2008.04948  [pdf, other

    cs.SI physics.soc-ph stat.AP stat.ML

    Hypergraph reconstruction from network data

    Authors: Jean-Gabriel Young, Giovanni Petri, Tiago P. Peixoto

    Abstract: Networks can describe the structure of a wide variety of complex systems by specifying which pairs of entities in the system are connected. While such pairwise representations are flexible, they are not necessarily appropriate when the fundamental interactions involve more than two entities at the same time. Pairwise representations nonetheless remain ubiquitous, because higher-order interactions… ▽ More

    Submitted 13 January, 2022; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: 13 pages, 7 figures. Code is available at https://graph-tool.skewed.de/

    Journal ref: Communication Physics 4, 135 (2021)

  11. arXiv:2006.14493  [pdf, other

    physics.soc-ph cond-mat.dis-nn cs.SI stat.ML

    Statistical inference of assortative community structures

    Authors: Lizhi Zhang, Tiago P. Peixoto

    Abstract: We develop a principled methodology to infer assortative communities in networks based on a nonparametric Bayesian formulation of the planted partition model. We show that this approach succeeds in finding statistically significant assortative modules in networks, unlike alternatives such as modularity maximization, which systematically overfits both in artificial as well as in empirical examples.… ▽ More

    Submitted 29 June, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 15 pages, 6 figures. Code is available at https://graph-tool.skewed.de and a HOWTO documentation at https://graph-tool.skewed.de/static/doc/demos/inference/inference.html

    Journal ref: Phys. Rev. Research 2, 043271 (2020)

  12. arXiv:2005.13977  [pdf, other

    physics.soc-ph cs.LG cs.SI stat.ML

    Revealing consensus and dissensus between network partitions

    Authors: Tiago P. Peixoto

    Abstract: Community detection methods attempt to divide a network into groups of nodes that share similar properties, thus revealing its large-scale structure. A major challenge when employing such methods is that they are often degenerate, typically yielding a complex landscape of competing answers. As an attempt to extract understanding from a population of alternative solutions, many methods exist to est… ▽ More

    Submitted 21 April, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 28 pages, 16 figures

    Journal ref: Phys. Rev. X 11, 021003 (2021)

  13. arXiv:2003.07070  [pdf, other

    physics.soc-ph cs.LG cs.SI physics.data-an stat.ML

    Merge-split Markov chain Monte Carlo for community detection

    Authors: Tiago P. Peixoto

    Abstract: We present a Markov chain Monte Carlo scheme based on merges and splits of groups that is capable of efficiently sampling from the posterior distribution of network partitions, defined according to the stochastic block model (SBM). We demonstrate how schemes based on the move of single nodes between groups systematically fail at correctly sampling from the posterior distribution even on small netw… ▽ More

    Submitted 13 July, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

    Comments: 13 pages, 6 figures. Code available at https://graph-tool.skewed.de/static/doc/demos/inference/inference.html

    Journal ref: Phys. Rev. E 102, 012305 (2020)

  14. arXiv:2002.07803  [pdf, other

    physics.soc-ph cs.LG cs.SI physics.data-an stat.ML

    Latent Poisson models for networks with heterogeneous density

    Authors: Tiago P. Peixoto

    Abstract: Empirical networks are often globally sparse, with a small average number of connections per node, when compared to the total size of the network. However, this sparsity tends not to be homogeneous, and networks can also be locally dense, for example with a few nodes connecting to a large fraction of the rest of the network, or with small groups of nodes with a large probability of connections bet… ▽ More

    Submitted 17 July, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: 19 pages, 16 figures

    Journal ref: Phys. Rev. E 102, 012309 (2020)

  15. arXiv:1903.10833  [pdf, other

    physics.soc-ph cs.SI physics.data-an stat.ML

    Network reconstruction and community detection from dynamics

    Authors: Tiago P. Peixoto

    Abstract: We present a scalable nonparametric Bayesian method to perform network reconstruction from observed functional behavior that at the same time infers the communities present in the network. We show that the joint reconstruction with community detection has a synergistic effect, where the edge correlations used to inform the existence of communities are also inherently used to improve the accuracy o… ▽ More

    Submitted 20 September, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: 11 pages, 6 figures, 2 tables

    Journal ref: Phys. Rev. Lett. 123, 128301 (2019)

  16. arXiv:1806.07956  [pdf, other

    cs.SI cs.LG physics.data-an stat.ML

    Reconstructing networks with unknown and heterogeneous errors

    Authors: Tiago P. Peixoto

    Abstract: The vast majority of network datasets contains errors and omissions, although this is rarely incorporated in traditional network analysis. Recently, an increasing effort has been made to fill this methodological gap by developing network reconstruction approaches based on Bayesian inference. These approaches, however, rely on assumptions of uniform error rates and on direct estimations of the exis… ▽ More

    Submitted 18 October, 2018; v1 submitted 9 June, 2018; originally announced June 2018.

    Comments: 27 pages, 17 figures

    Journal ref: Phys. Rev. X 8, 041011 (2018)

  17. arXiv:1708.01677  [pdf, other

    stat.ML cs.CL physics.data-an physics.soc-ph

    A network approach to topic models

    Authors: Martin Gerlach, Tiago P. Peixoto, Eduardo G. Altmann

    Abstract: One of the main computational and scientific challenges in the modern age is to extract useful information from unstructured texts. Topic models are one popular machine-learning approach which infers the latent topical structure of a collection of documents. Despite their success --- in particular of its most widely used variant called Latent Dirichlet Allocation (LDA) --- and numerous application… ▽ More

    Submitted 19 July, 2018; v1 submitted 4 August, 2017; originally announced August 2017.

    Comments: 22 pages, 10 figures, code available at https://topsbm.github.io/

    Journal ref: Science Advances 4, eaaq1360 (2018)

  18. arXiv:1708.01432  [pdf, other

    stat.ML physics.data-an physics.soc-ph

    Nonparametric weighted stochastic block models

    Authors: Tiago P. Peixoto

    Abstract: We present a Bayesian formulation of weighted stochastic block models that can be used to infer the large-scale modular structure of weighted networks, including their hierarchical organization. Our method is nonparametric, and thus does not require the prior knowledge of the number of groups or other dimensions of the model, which are instead inferred from data. We give a comprehensive treatment… ▽ More

    Submitted 18 January, 2018; v1 submitted 4 August, 2017; originally announced August 2017.

    Comments: 19 pages, 11 figures. Code is freely available as part of graph-tool at https://graph-tool.skewed.de . See also the HOWTO at https://graph-tool.skewed.de/static/doc/demos/inference/inference.html

    Journal ref: Phys. Rev. E 97, 012306 (2018)

  19. arXiv:1705.10225  [pdf, other

    stat.ML cond-mat.stat-mech physics.data-an

    Bayesian stochastic blockmodeling

    Authors: Tiago P. Peixoto

    Abstract: This chapter provides a self-contained introduction to the use of Bayesian inference to extract large-scale modular structures from network data, based on the stochastic blockmodel (SBM), as well as its degree-corrected and overlapping generalizations. We focus on nonparametric formulations that allow their inference in a manner that prevents overfitting, and enables model selection. We discuss as… ▽ More

    Submitted 22 March, 2023; v1 submitted 29 May, 2017; originally announced May 2017.

    Comments: 44 pages, 16 figures. Minor typos fixed. Code is freely available as part of graph-tool at https://graph-tool.skewed.de . See also the HOWTO at https://graph-tool.skewed.de/static/doc/demos/inference/inference.html

    Journal ref: "Advances in Network Clustering and Blockmodeling", edited by P. Doreian, V. Batagelj, A. Ferligoj, (Wiley, New York, 2019)

  20. arXiv:1705.07967  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech

    Consistencies and inconsistencies between model selection and link prediction in networks

    Authors: Toni Vallès-Català, Tiago P. Peixoto, Roger Guimerà, Marta Sales-Pardo

    Abstract: A principled approach to understand network structures is to formulate generative models. Given a collection of models, however, an outstanding key task is to determine which one provides a more accurate description of the network at hand, discounting statistical fluctuations. This problem can be approached using two principled criteria that at first may seem equivalent: selecting the most plausib… ▽ More

    Submitted 28 June, 2018; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: 12 pages, 6 figures, 1 table

    Journal ref: Phys. Rev. E 97, 062316 (2018)

  21. arXiv:1610.02703  [pdf, other

    physics.data-an physics.soc-ph stat.ML

    Nonparametric Bayesian inference of the microcanonical stochastic block model

    Authors: Tiago P. Peixoto

    Abstract: A principled approach to characterize the hidden structure of networks is to formulate generative models, and then infer their parameters from data. When the desired structure is composed of modules or "communities", a suitable choice for this task is the stochastic block model (SBM), where nodes are divided into groups, and the placement of edges is conditioned on the group memberships. Here, we… ▽ More

    Submitted 22 August, 2018; v1 submitted 9 October, 2016; originally announced October 2016.

    Comments: 24 pages, 9 figures, 1 table. Code is freely available as part of graph-tool at https://graph-tool.skewed.de . See also the HOWTO at https://graph-tool.skewed.de/static/doc/demos/inference/inference.html . Minor typos fixed in most recent version

    Journal ref: Phys. Rev. E 95, 012317 (2017)

  22. arXiv:1604.00255  [pdf, other

    physics.soc-ph cs.SI stat.ML

    Network structure, metadata and the prediction of missing nodes and annotations

    Authors: Darko Hric, Tiago P. Peixoto, Santo Fortunato

    Abstract: The empirical validation of community detection methods is often based on available annotations on the nodes that serve as putative indicators of the large-scale network structure. Most often, the suitability of the annotations as topological descriptors itself is not assessed, and without this it is not possible to ultimately distinguish between actual shortcomings of the community detection algo… ▽ More

    Submitted 29 September, 2016; v1 submitted 1 April, 2016; originally announced April 2016.

    Comments: 15 pages, 6 figures, 1 table

    Journal ref: Phys. Rev. X 6, 031038 (2016)

  23. arXiv:1509.04740  [pdf, other

    cs.SI cond-mat.stat-mech physics.soc-ph stat.ML

    Modeling sequences and temporal networks with dynamic community structures

    Authors: Tiago P. Peixoto, Martin Rosvall

    Abstract: In evolving complex systems such as air traffic and social organizations, collective effects emerge from their many components' dynamic interactions. While the dynamic interactions can be represented by temporal networks with nodes and links that change over time, they remain highly complex. It is therefore often necessary to use methods that extract the temporal networks' large-scale dynamic comm… ▽ More

    Submitted 20 September, 2017; v1 submitted 15 September, 2015; originally announced September 2015.

    Comments: 15 Pages, 6 figures, 2 tables

    Journal ref: Nature Communications 8, 582 (2017)

  24. arXiv:1310.4378  [pdf, other

    physics.data-an cond-mat.stat-mech cs.SI physics.comp-ph stat.ML

    Efficient Monte Carlo and greedy heuristic for the inference of stochastic block models

    Authors: Tiago P. Peixoto

    Abstract: We present an efficient algorithm for the inference of stochastic block models in large networks. The algorithm can be used as an optimized Markov chain Monte Carlo (MCMC) method, with a fast mixing time and a much reduced susceptibility to getting trapped in metastable states, or as a greedy agglomerative heuristic, with an almost linear $O(N\ln^2N)$ complexity, where $N$ is the number of nodes i… ▽ More

    Submitted 13 January, 2014; v1 submitted 16 October, 2013; originally announced October 2013.

    Comments: 9 pages, 9 figures

    Journal ref: Phys. Rev. E 89, 012804 (2014)

  25. arXiv:1310.4377  [pdf, other

    physics.data-an cond-mat.dis-nn cond-mat.stat-mech cs.SI physics.soc-ph stat.ML

    Hierarchical Block Structures and High-resolution Model Selection in Large Networks

    Authors: Tiago P. Peixoto

    Abstract: Discovering and characterizing the large-scale topological features in empirical networks are crucial steps in understanding how complex systems function. However, most existing methods used to obtain the modular structure of networks suffer from serious problems, such as being oblivious to the statistical evidence supporting the discovered patterns, which results in the inability to separate actu… ▽ More

    Submitted 25 March, 2014; v1 submitted 16 October, 2013; originally announced October 2013.

    Comments: 18 pages, 9 figures + Supplemental Material

    Journal ref: Phys. Rev. X 4, 011047 (2014)

  26. arXiv:1212.4794  [pdf, other

    physics.data-an physics.soc-ph stat.ML

    Parsimonious module inference in large networks

    Authors: Tiago P. Peixoto

    Abstract: We investigate the detectability of modules in large networks when the number of modules is not known in advance. We employ the minimum description length (MDL) principle which seeks to minimize the total amount of information required to describe the network, and avoid overfitting. According to this criterion, we obtain general bounds on the detectability of any prescribed block structure, given… ▽ More

    Submitted 8 April, 2013; v1 submitted 19 December, 2012; originally announced December 2012.

    Comments: 5 pages, 4 figures + Supplemental Material

    Journal ref: Phys. Rev. Lett. 110, 148701 (2013)