-
LDP for the covariance process in fully connected neural networks
Authors:
Luisa Andreis,
Federico Bassetti,
Christian Hirsch
Abstract:
In this work, we study large deviation properties of the covariance process in fully connected Gaussian deep neural networks. More precisely, we establish a large deviation principle (LDP) for the covariance process in a functional framework, viewing it as a process in the space of continuous functions. As key applications of our main results, we obtain posterior LDPs under Gaussian likelihood in…
▽ More
In this work, we study large deviation properties of the covariance process in fully connected Gaussian deep neural networks. More precisely, we establish a large deviation principle (LDP) for the covariance process in a functional framework, viewing it as a process in the space of continuous functions. As key applications of our main results, we obtain posterior LDPs under Gaussian likelihood in both the infinite-width and mean-field regimes. The proof is based on an LDP for the covariance process as a Markov process valued in the space of non-negative, symmetric trace-class operators equipped with the trace norm.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Proportional infinite-width infinite-depth limit for deep linear neural networks
Authors:
Federico Bassetti,
Lucia Ladelli,
Pietro Rotondo
Abstract:
We study the distributional properties of linear neural networks with random parameters in the context of large networks, where the number of layers diverges in proportion to the number of neurons per layer. Prior works have shown that in the infinite-width regime, where the number of neurons per layer grows to infinity while the depth remains fixed, neural networks converge to a Gaussian process,…
▽ More
We study the distributional properties of linear neural networks with random parameters in the context of large networks, where the number of layers diverges in proportion to the number of neurons per layer. Prior works have shown that in the infinite-width regime, where the number of neurons per layer grows to infinity while the depth remains fixed, neural networks converge to a Gaussian process, known as the Neural Network Gaussian Process. However, this Gaussian limit sacrifices descriptive power, as it lacks the ability to learn dependent features and produce output correlations that reflect observed labels. Motivated by these limitations, we explore the joint proportional limit in which both depth and width diverge but maintain a constant ratio, yielding a non-Gaussian distribution that retains correlations between outputs. Our contribution extends previous works by rigorously characterizing, for linear activation functions, the limiting distribution as a nontrivial mixture of Gaussians.
△ Less
Submitted 22 November, 2024;
originally announced November 2024.
-
Feature learning in finite-width Bayesian deep linear networks with multiple outputs and convolutional layers
Authors:
Federico Bassetti,
Marco Gherardi,
Alessandro Ingrosso,
Mauro Pastore,
Pietro Rotondo
Abstract:
Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript, we provide rigorous results for the statistics of functions implemented by the aforementioned class of networks, thus moving closer to a complete characterizatio…
▽ More
Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript, we provide rigorous results for the statistics of functions implemented by the aforementioned class of networks, thus moving closer to a complete characterization of feature learning in the Bayesian setting. Our results include: (i) an exact and elementary non-asymptotic integral representation for the joint prior distribution over the outputs, given in terms of a mixture of Gaussians; (ii) an analytical formula for the posterior distribution in the case of squared error loss function (Gaussian likelihood); (iii) a quantitative description of the feature learning infinite-width regime, using large deviation theory. From a physical perspective, deep architectures with multiple outputs or convolutional layers represent different manifestations of kernel shape renormalization, and our work provides a dictionary that translates this physics intuition and terminology into rigorous Bayesian statistics.
△ Less
Submitted 16 June, 2025; v1 submitted 5 June, 2024;
originally announced June 2024.
-
Clustering structure for species sampling sequences with general base measure
Authors:
Federico Bassetti,
Lucia Ladelli
Abstract:
We investigate the clustering structure of species sampling sequences $(ξ_n)_n$, with general base measure. Such sequences are exchangeable with a species sampling random probability as directing measure. The clustering properties of these sequences are interesting for Bayesian nonparametrics applications, where mixed base measures are used, for example, to accommodate sharp hypotheses in regressi…
▽ More
We investigate the clustering structure of species sampling sequences $(ξ_n)_n$, with general base measure. Such sequences are exchangeable with a species sampling random probability as directing measure. The clustering properties of these sequences are interesting for Bayesian nonparametrics applications, where mixed base measures are used, for example, to accommodate sharp hypotheses in regression problems and provide sparsity. In this paper, we prove a stochastic representation for $(ξ_n)_n$ in terms of a latent exchangeable random partition. We provide explicit expression of the EPPF of the partition generated by $(ξ_n)_n$ in terms of the EPPF of the latent partition. We investigate the asymptotic behaviour of the total number of blocks and of the number of blocks with fixed cardinality in the partition generated by $(ξ_n)_n$.
△ Less
Submitted 28 August, 2019;
originally announced August 2019.
-
Computing Kantorovich-Wasserstein Distances on $d$-dimensional histograms using $(d+1)$-partite graphs
Authors:
Gennaro Auricchio,
Federico Bassetti,
Stefano Gualandi,
Marco Veneroni
Abstract:
This paper presents a novel method to compute the exact Kantorovich-Wasserstein distance between a pair of $d$-dimensional histograms having $n$ bins each. We prove that this problem is equivalent to an uncapacitated minimum cost flow problem on a $(d+1)$-partite graph with $(d+1)n$ nodes and $dn^{\frac{d+1}{d}}$ arcs, whenever the cost is separable along the principal $d$-dimensional directions.…
▽ More
This paper presents a novel method to compute the exact Kantorovich-Wasserstein distance between a pair of $d$-dimensional histograms having $n$ bins each. We prove that this problem is equivalent to an uncapacitated minimum cost flow problem on a $(d+1)$-partite graph with $(d+1)n$ nodes and $dn^{\frac{d+1}{d}}$ arcs, whenever the cost is separable along the principal $d$-dimensional directions. We show numerically the benefits of our approach by computing the Kantorovich-Wasserstein distance of order 2 among two sets of instances: gray scale images and $d$-dimensional biomedical histograms. On these types of instances, our approach is competitive with state-of-the-art optimal transport algorithms.
△ Less
Submitted 11 January, 2019; v1 submitted 18 May, 2018;
originally announced May 2018.
-
On the Computation of Kantorovich-Wasserstein Distances between 2D-Histograms by Uncapacitated Minimum Cost Flows
Authors:
Federico Bassetti,
Stefano Gualandi,
Marco Veneroni
Abstract:
In this work, we present a method to compute the Kantorovich-Wasserstein distance of order one between a pair of two-dimensional histograms. Recent works in Computer Vision and Machine Learning have shown the benefits of measuring Wasserstein distances of order one between histograms with $n$ bins, by solving a classical transportation problem on very large complete bipartite graphs with $n$ nodes…
▽ More
In this work, we present a method to compute the Kantorovich-Wasserstein distance of order one between a pair of two-dimensional histograms. Recent works in Computer Vision and Machine Learning have shown the benefits of measuring Wasserstein distances of order one between histograms with $n$ bins, by solving a classical transportation problem on very large complete bipartite graphs with $n$ nodes and $n^2$ edges. The main contribution of our work is to approximate the original transportation problem by an uncapacitated min cost flow problem on a reduced flow network of size $O(n)$ that exploits the geometric structure of the cost function. More precisely, when the distance among the bin centers is measured with the 1-norm or the $\infty$-norm, our approach provides an optimal solution. When the distance among bins is measured with the 2-norm: (i) we derive a quantitative estimate on the error between optimal and approximate solution; (ii) given the error, we construct a reduced flow network of size $O(n)$. We numerically show the benefits of our approach by computing Wasserstein distances of order one on a set of grey scale images used as benchmark in the literature. We show how our approach scales with the size of the images with 1-norm, 2-norm and $\infty$-norm ground distances, and we compare it with other two methods which are largely used in the literature.
△ Less
Submitted 26 July, 2019; v1 submitted 2 April, 2018;
originally announced April 2018.
-
Hierarchical Species Sampling Models
Authors:
Federico Bassetti,
Roberto Casarin,
Luca Rossini
Abstract:
This paper introduces a general class of hierarchical nonparametric prior distributions. The random probability measures are constructed by a hierarchy of generalized species sampling processes with possibly non-diffuse base measures. The proposed framework provides a general probabilistic foundation for hierarchical random measures with either atomic or mixed base measures and allows for studying…
▽ More
This paper introduces a general class of hierarchical nonparametric prior distributions. The random probability measures are constructed by a hierarchy of generalized species sampling processes with possibly non-diffuse base measures. The proposed framework provides a general probabilistic foundation for hierarchical random measures with either atomic or mixed base measures and allows for studying their properties, such as the distribution of the marginal and total number of clusters. We show that hierarchical species sampling models have a Chinese Restaurants Franchise representation and can be used as prior distributions to undertake Bayesian nonparametric inference. We provide a method to sample from the posterior distribution together with some numerical illustrations. Our class of priors includes some new hierarchical mixture priors such as the hierarchical Gnedin measures, and other well-known prior distributions such as the hierarchical Pitman-Yor and the hierarchical normalized random measures.
△ Less
Submitted 15 March, 2018;
originally announced March 2018.
-
Mean field dynamics of collisional processes with duplication, loss and copy
Authors:
Federico Bassetti,
Giuseppe Toscani
Abstract:
In this paper we introduce and discuss kinetic equations for the evolution of the probability distribution of the number of particles in a population subject to binary interactions. The microscopic binary law of interaction is assumed to be dependent on fixed-in-time random parameters which describe both birth and death of particles, and the migration rule. These assumptions lead to a Boltzmann-ty…
▽ More
In this paper we introduce and discuss kinetic equations for the evolution of the probability distribution of the number of particles in a population subject to binary interactions. The microscopic binary law of interaction is assumed to be dependent on fixed-in-time random parameters which describe both birth and death of particles, and the migration rule. These assumptions lead to a Boltzmann-type equation that in the case in which the mean number of the population is preserved, can be fully studied, by obtaining in some case the analytic description of the steady profile. In all cases, however, a simpler kinetic description can be derived, by considering the limit of quasi-invariant interactions. This procedure allows to describe the evolution process in terms of a linear kinetic transport-type equation. Among the various processes that can be described in this way, one recognizes the Lea-Coulson model of mutation processes in bacteria, a variation of the original model proposed by Luria and Delbrück.
△ Less
Submitted 12 January, 2015;
originally announced January 2015.
-
Infinite energy solutions to inelastic homogeneous Boltzmann equation
Authors:
Federico Bassetti,
Lucia Ladelli,
Daniel Matthes
Abstract:
This paper is concerned with the existence, shape and dynamical stability of infinite-energy equilibria for a general class of spatially homogeneous kinetic equations in space dimensions $d \geq 3$. Our results cover in particular Bobylëv's model for inelastic Maxwell molecules. First, we show under certain conditions on the collision kernel, that there exists an index $α\in(0,2)$ such that the eq…
▽ More
This paper is concerned with the existence, shape and dynamical stability of infinite-energy equilibria for a general class of spatially homogeneous kinetic equations in space dimensions $d \geq 3$. Our results cover in particular Bobylëv's model for inelastic Maxwell molecules. First, we show under certain conditions on the collision kernel, that there exists an index $α\in(0,2)$ such that the equation possesses a nontrivial stationary solution, which is a scale mixture of radially symmetric $α$-stable laws. We also characterize the mixing distribution as the fixed point of a smoothing transformation. Second, we prove that any transient solution that emerges from the NDA of some (not necessarily radial symmetric) $α$-stable distribution converges to an equilibrium. The key element of the convergence proof is an application of the central limit theorem to a representation of the transient solution as a weighted sum of i.i.d. random vectors.
△ Less
Submitted 27 September, 2013;
originally announced September 2013.
-
Large Deviations for the solution of a Kac-type kinetic equation
Authors:
Federico Bassetti,
Lucia Ladelli
Abstract:
The aim of this paper is to study large deviations for the self-similar solution of a Kac-type kinetic equation. Under the assumption that the initial condition belongs to the domain of normal attraction of a stable law of index $α<2$ and under suitable assumptions on the collisional kernel, precise asymptotic behavior of the large deviations probability is given.
The aim of this paper is to study large deviations for the self-similar solution of a Kac-type kinetic equation. Under the assumption that the initial condition belongs to the domain of normal attraction of a stable law of index $α<2$ and under suitable assumptions on the collisional kernel, precise asymptotic behavior of the large deviations probability is given.
△ Less
Submitted 16 October, 2012;
originally announced October 2012.
-
Speed of convergence to equilibrium in Wasserstein metrics for Kac-s like kinetic equations
Authors:
Federico Bassetti,
Eleonora Perversi
Abstract:
This work deals with a class of one-dimensional measure-valued kinetic equations, which constitute extensions of the Kac caricature. It is known that if the initial datum belongs to the domain of normal attraction of an α-stable law, the solution of the equation converges weakly to a suitable scale mixture of centered α-stable laws. In this paper we present explicit exponential rates for the conve…
▽ More
This work deals with a class of one-dimensional measure-valued kinetic equations, which constitute extensions of the Kac caricature. It is known that if the initial datum belongs to the domain of normal attraction of an α-stable law, the solution of the equation converges weakly to a suitable scale mixture of centered α-stable laws. In this paper we present explicit exponential rates for the convergence to equilibrium in Kantorovich-Wasserstein distances of order p>α, under the natural assumption that the distance between the initial datum and the limit distribution is finite. For α=2 this assumption reduces to the finiteness of the absolute moment of order p of the initial datum. On the contrary, when α<2, the situation is more problematic due to the fact that both the limit distribution and the initial datum have infinite absolute moment of any order p >α. For this case, we provide sufficient conditions for the finiteness of the Kantorovich-Wasserstein distance.
△ Less
Submitted 22 October, 2012; v1 submitted 16 May, 2012;
originally announced May 2012.
-
Beta-Product Poisson-Dirichlet Processes
Authors:
Federico Bassetti,
Roberto Casarin,
Fabrizio Leisen
Abstract:
Time series data may exhibit clustering over time and, in a multiple time series context, the clustering behavior may differ across the series. This paper is motivated by the Bayesian non--parametric modeling of the dependence between the clustering structures and the distributions of different time series. We follow a Dirichlet process mixture approach and introduce a new class of multivariate de…
▽ More
Time series data may exhibit clustering over time and, in a multiple time series context, the clustering behavior may differ across the series. This paper is motivated by the Bayesian non--parametric modeling of the dependence between the clustering structures and the distributions of different time series. We follow a Dirichlet process mixture approach and introduce a new class of multivariate dependent Dirichlet processes (DDP). The proposed DDP are represented in terms of vector of stick-breaking processes with dependent weights. The weights are beta random vectors that determine different and dependent clustering effects along the dimension of the DDP vector. We discuss some theoretical properties and provide an efficient Monte Carlo Markov Chain algorithm for posterior computation. The effectiveness of the method is illustrated with a simulation study and an application to the United States and the European Union industrial production indexes.
△ Less
Submitted 22 September, 2011;
originally announced September 2011.
-
Homogeneous kinetic equations for probabilistic linear collisions in multiple space dimensions
Authors:
Federico Bassetti,
Daniel Matthes
Abstract:
We analyze the convergence to equilibrium in a family of Kac-like kinetic equations in multiple space dimensions. These equations describe the change of the velocity distribution in a spatially homogeneous gas due to binary collisions between the particles. We consider a general linear mechanism for the exchange of the particles' momenta, with interaction coefficients that are random matrices with…
▽ More
We analyze the convergence to equilibrium in a family of Kac-like kinetic equations in multiple space dimensions. These equations describe the change of the velocity distribution in a spatially homogeneous gas due to binary collisions between the particles. We consider a general linear mechanism for the exchange of the particles' momenta, with interaction coefficients that are random matrices with a distribution that is {independent} of the velocities of the colliding particles. Applying a synthesis of probabilistic methods and Fourier analysis, we are able to identify sufficient conditions for the existence and uniqueness of a stationary state, we characterize this stationary state as a mixture of Gaussian distributions, and we prove equilibration of transient solutions under minimal hypotheses on the initial conditions. In particular, we are able to classify the high-energy tails of the stationary distribution, which might be of Pareto type. We also discuss several examples to which our theory applies, among them models with a non-symmetric stationary state.
△ Less
Submitted 12 May, 2011;
originally announced May 2011.
-
Generalized Species Sampling Priors with Latent Beta reinforcements
Authors:
Edoardo M. Airoldi,
Thiago Costa,
Federico Bassetti,
Fabrizio Leisen,
Michele Guindani
Abstract:
Many popular Bayesian nonparametric priors can be characterized in terms of exchangeable species sampling sequences. However, in some applications, exchangeability may not be appropriate. We introduce a {novel and probabilistically coherent family of non-exchangeable species sampling sequences characterized by a tractable predictive probability function with weights driven by a sequence of indepen…
▽ More
Many popular Bayesian nonparametric priors can be characterized in terms of exchangeable species sampling sequences. However, in some applications, exchangeability may not be appropriate. We introduce a {novel and probabilistically coherent family of non-exchangeable species sampling sequences characterized by a tractable predictive probability function with weights driven by a sequence of independent Beta random variables. We compare their theoretical clustering properties with those of the Dirichlet Process and the two parameters Poisson-Dirichlet process. The proposed construction provides a complete characterization of the joint process, differently from existing work. We then propose the use of such process as prior distribution in a hierarchical Bayes modeling framework, and we describe a Markov Chain Monte Carlo sampler for posterior inference. We evaluate the performance of the prior and the robustness of the resulting inference in a simulation study, providing a comparison with popular Dirichlet Processes mixtures and Hidden Markov Models. Finally, we develop an application to the detection of chromosomal aberrations in breast cancer by leveraging array CGH data.
△ Less
Submitted 1 August, 2014; v1 submitted 3 December, 2010;
originally announced December 2010.
-
Self-similar solutions in one-dimensional kinetic models: A probabilistic view
Authors:
Federico Bassetti,
Lucia Ladelli
Abstract:
This paper deals with a class of Boltzmann equations on the real line, extensions of the well-known Kac caricature. A distinguishing feature of the corresponding equations is that therein, the collision gain operators are defined by N-linear smoothing transformations. These kind of problems have been studied, from an essentially analytic viewpoint, in a recent paper by Bobylev, Cercignani and Gamb…
▽ More
This paper deals with a class of Boltzmann equations on the real line, extensions of the well-known Kac caricature. A distinguishing feature of the corresponding equations is that therein, the collision gain operators are defined by N-linear smoothing transformations. These kind of problems have been studied, from an essentially analytic viewpoint, in a recent paper by Bobylev, Cercignani and Gamba [Comm. Math. Phys. 291 (2009) 599-644]. Instead, the present work rests exclusively on probabilistic methods, based on techniques pertaining to the classical central limit problem and to the so-called fixed-point equations for probability distributions. An advantage of resorting to methods from the probability theory is that the same results - relative to self-similar solutions - as those obtained by Bobylev, Cercignani and Gamba, are here deduced under weaker conditions. In particular, it is shown how convergence to a self-similar solution depends on the belonging of the initial datum to the domain of attraction of a specific stable distribution. Moreover, some results on the speed of convergence are given in terms of Kantorovich-Wasserstein and Zolotarev distances between probability measures.
△ Less
Submitted 19 October, 2012; v1 submitted 29 March, 2010;
originally announced March 2010.
-
Central limit theorem for a class of one-dimensional kinetic equations
Authors:
Federico Bassetti,
Lucia Ladelli,
Daniel Matthes
Abstract:
We introduce a class of Boltzmann equations on the real line, which constitute extensions of the classical Kac caricature. The collisional gain operators are defined by smoothing transformations with quite general properties. By establishing a connection to the central limit problem, we are able to prove long-time convergence of the equation's solutions towards a limit distribution. If the initi…
▽ More
We introduce a class of Boltzmann equations on the real line, which constitute extensions of the classical Kac caricature. The collisional gain operators are defined by smoothing transformations with quite general properties. By establishing a connection to the central limit problem, we are able to prove long-time convergence of the equation's solutions towards a limit distribution. If the initial condition for the Boltzmann equation belongs to the domain of normal attraction of a certain stable law $g_α$, then the limit is non-trivial and is a statistical mixture of dilations of $g_α$. Under some additional assumptions, explicit exponential rates for the equilibration in Wasserstein metrics are calculated, and strong convergence of the probability densities is shown.
△ Less
Submitted 16 October, 2008; v1 submitted 9 September, 2008;
originally announced September 2008.
-
Quantitative comparisons between finitary posterior distributions and Bayesian posterior distributions
Authors:
Federico Bassetti
Abstract:
The main object of Bayesian statistical inference is the determination of posterior distributions. Sometimes these laws are given for quantities devoid of empirical value. This serious drawback vanishes when one confines oneself to considering a finite horizon framework. However, assuming infinite exchangeability gives rise to fairly tractable {\it a posteriori} quantities, which is very attract…
▽ More
The main object of Bayesian statistical inference is the determination of posterior distributions. Sometimes these laws are given for quantities devoid of empirical value. This serious drawback vanishes when one confines oneself to considering a finite horizon framework. However, assuming infinite exchangeability gives rise to fairly tractable {\it a posteriori} quantities, which is very attractive in applications. Hence, with a view to a reconciliation between these two aspects of the Bayesian way of reasoning, in this paper we provide quantitative comparisons between posterior distributions of finitary parameters and posterior distributions of allied parameters appearing in usual statistical models.
△ Less
Submitted 8 July, 2008;
originally announced July 2008.
-
Conditionally identically distributed species sampling sequences
Authors:
Federico Bassetti,
Irene Crimaldi,
Fabrizio Leisen
Abstract:
Conditional identity in distribution (Berti et al. (2004)) is a new type of dependence for random variables, which generalizes the well-known notion of exchangeability. In this paper, a class of random sequences, called Generalized Species Sampling Sequences, is defined and a condition to have conditional identity in distribution is given. Moreover, a class of generalized species sampling sequen…
▽ More
Conditional identity in distribution (Berti et al. (2004)) is a new type of dependence for random variables, which generalizes the well-known notion of exchangeability. In this paper, a class of random sequences, called Generalized Species Sampling Sequences, is defined and a condition to have conditional identity in distribution is given. Moreover, a class of generalized species sampling sequences that are conditionally identically distributed is introduced and studied: the Generalized Ottawa sequences (GOS). This class contains a '`randomly reinforced'' version of the Pólya urn and of the Blackwell-MacQueen urn scheme. For the empirical means and the predictive means of a GOS, we prove two convergence results toward suitable mixtures of Gaussian distributions. The first one is in the sense of stable convergence and the second one in the sense of almost sure conditional convergence. In the last part of the paper we study the length of the partition induced by a GOS at time $n$, i.e. the random number of distinct values of a GOS until time $n$. Under suitable conditions, we prove a strong law of large numbers and a central limit theorem in the sense of stable convergence. All the given results in the paper are accompanied by some examples.
△ Less
Submitted 17 June, 2008;
originally announced June 2008.
-
Probabilistic study of the speed of approach to equilibrium for an inelastic Kac model
Authors:
Federico Bassetti,
Lucia Ladelli,
Eugenio Regazzini
Abstract:
This paper deals with a one--dimensional model for granular materials, which boils down to an inelastic version of the Kac kinetic equation, with inelasticity parameter $p>0$. In particular, the paper provides bounds for certain distances -- such as specific weighted $χ$--distances and the Kolmogorov distance -- between the solution of that equation and the limit. It is assumed that the even par…
▽ More
This paper deals with a one--dimensional model for granular materials, which boils down to an inelastic version of the Kac kinetic equation, with inelasticity parameter $p>0$. In particular, the paper provides bounds for certain distances -- such as specific weighted $χ$--distances and the Kolmogorov distance -- between the solution of that equation and the limit. It is assumed that the even part of the initial datum (which determines the asymptotic properties of the solution) belongs to the domain of normal attraction of a symmetric stable distribution with characteristic exponent $\a=2/(1+p)$. With such initial data, it turns out that the limit exists and is just the aforementioned stable distribution. A necessary condition for the relaxation to equilibrium is also proved. Some bounds are obtained without introducing any extra--condition. Sharper bounds, of an exponential type, are exhibited in the presence of additional assumptions concerning either the behaviour, near to the origin, of the initial characteristic function, or the behaviour, at infinity, of the initial probability distribution function.
△ Less
Submitted 22 May, 2008;
originally announced May 2008.
-
Exchangeable Random Networks
Authors:
F. Bassetti,
M. Cosentino Lagomarsino,
S. Mandrá
Abstract:
We introduce and study a class of exchangeable random graph ensembles. They can be used as statistical null models for empirical networks, and as a tool for theoretical investigations. We provide general theorems that carachterize the degree distribution of the ensemble graphs, together with some features that are important for applications, such as subgraph distributions and kernel of the adjac…
▽ More
We introduce and study a class of exchangeable random graph ensembles. They can be used as statistical null models for empirical networks, and as a tool for theoretical investigations. We provide general theorems that carachterize the degree distribution of the ensemble graphs, together with some features that are important for applications, such as subgraph distributions and kernel of the adjacency matrix. These results are used to compare to other models of simple and complex networks. A particular case of directed networks with power-law out--degree is studied in more detail, as an example of the flexibility of the model in applications.
△ Less
Submitted 12 August, 2008; v1 submitted 24 July, 2007;
originally announced July 2007.