-
Respondent Driven Sampling on sparse Erdös-Rényi graphs
Authors:
Anthony Cousien,
Jean-Stéphane Dhersin,
Viet Chi Tran,
Thi Phuong Thuy Vo
Abstract:
We study the exploration of an Erdös-Rényi random graph by a respondent-driven sampling method, where discovered vertices reveal their neighbours. Some of them receive coupons to reveal in their turn their own neighbourhood. This leads to the study of a Markov chain on the random graph that we study. For sparse Erdös-Rényi graphs of large sizes, this process correctly renormalized converges to the…
▽ More
We study the exploration of an Erdös-Rényi random graph by a respondent-driven sampling method, where discovered vertices reveal their neighbours. Some of them receive coupons to reveal in their turn their own neighbourhood. This leads to the study of a Markov chain on the random graph that we study. For sparse Erdös-Rényi graphs of large sizes, this process correctly renormalized converges to the solution of a deterministic curve, solution of a system of ODEs absorbed on the abscissa axis. The associated fluctuation process is also studied, providing a functional central limit theorem, with a Gaussian limiting process. Simulations and numerical computation illustrate the study.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
Estimation of dense stochastic block models visited by random walks
Authors:
Viet Chi Tran,
Thi Phuong Thuy Vo
Abstract:
We are interested in recovering information on a stochastic block model from the subgraph discovered by an exploring random walk. Stochastic block models correspond to populations structured into a finite number of types, where two individuals are connected by an edge independently from the other pairs and with a probability depending on their types. We consider here the dense case where the rando…
▽ More
We are interested in recovering information on a stochastic block model from the subgraph discovered by an exploring random walk. Stochastic block models correspond to populations structured into a finite number of types, where two individuals are connected by an edge independently from the other pairs and with a probability depending on their types. We consider here the dense case where the random network can be approximated by a graphon. This problem is motivated from the study of chain-referral surveys where each interviewee provides information on her/his contacts in the social network. First, we write the likelihood of the subgraph discovered by the random walk: biases are appearing since hubs and majority types are more likely to be sampled. Even for the case where the types are observed, the maximum likelihood estimator is not explicit any more. When the types of the vertices is unobserved, we use an SAEM algorithm to maximize the likelihood. Second, we propose a different estimation strategy using new results by Athreya and Roellin. It consists in de-biasing the maximum likelihood estimator proposed in Daudin et al. and that ignores the biases.
△ Less
Submitted 7 June, 2021; v1 submitted 14 June, 2020;
originally announced June 2020.
-
Chain-referral sampling on Stochastic Block Models
Authors:
Thi Phuong Thuy Vo
Abstract:
The discovery of the "hidden population", whose size and membership are unknown, is made possible by assuming that its members are connected in a social network by their relationships. We explore these groups by a chain-referral sampling (CRS) method, where participants recommend the people they know. This leads to the study of a Markov chain on a random graph where vertices represent individuals…
▽ More
The discovery of the "hidden population", whose size and membership are unknown, is made possible by assuming that its members are connected in a social network by their relationships. We explore these groups by a chain-referral sampling (CRS) method, where participants recommend the people they know. This leads to the study of a Markov chain on a random graph where vertices represent individuals and edges connecting any two nodes describe the relationships between corresponding people. We are interested in the study of CRS process on the stochastic block model (SBM), which extends the well-known Erdös-Rényi graphs to populations partitioned into communities. The SBM considered here is characterized by a number of vertices $N$, a number of communities (blocks) $m$, proportion of each community $π=(π_1,...,π_m)$ and a pattern for connection between blocks $P=(λ_{kl}/N)_{(k,l) \in \{1,...,m\}^2}$. In this paper, we give a precise description of the dynamic of CRS process in discrete time on an SBM. The difficulty lies in handling the heterogeneity of the graph. We prove that when the population's size is large, the normalized stochastic process of the referral chain behaves like a deterministic curve which is the unique solution of a system of ODEs.
△ Less
Submitted 20 May, 2020; v1 submitted 13 February, 2019;
originally announced February 2019.