Estimating network properties from snowball sampled data
2012 (English)In: Social Networks, ISSN 0378-8733, Vol. 34, no 4, 701-711 p.Article in journal (Refereed) Published
This article addresses the estimation of topological network parameters from data obtained with a snowball sampling design. An approximate expression for the probability of a vertex to be included in the sample is derived. Based on this sampling distribution, estimators for the mean degree, the degree correlation, and the clustering coefficient are proposed. The performance of these estimators and their sensitivity with respect to the response rate are validated through Monte Carlo simulations on several test networks. Our approach has no complex computational requirements and is straightforward to apply to real-world survey data. In a snowball sample design, each respondent is typically enquired only once. Different from the widely used estimator for Respondent-Driven Sampling (RDS), which assumes sampling with replacement, the proposed approach relies on sampling without replacement and is thus also applicable for large sample fractions. From the simulation experiments, we conclude that the estimation quality decreases with increasing variance of the network degree distribution. Yet, if the degree distribution is not to broad, our approach results in good estimates for the mean degree and the clustering coefficient, which, moreover, are almost independent from the response rate. The estimates for the degree correlation are of moderated quality.
Place, publisher, year, edition, pages
2012. Vol. 34, no 4, 701-711 p.
Monte Carlo simulation, Snowball sampling, Statistical inference
Transport Systems and Logistics
IdentifiersURN: urn:nbn:se:kth:diva-116858DOI: 10.1016/j.socnet.2012.09.001ISI: 000313304100032ScopusID: 2-s2.0-84872387933OAI: oai:DiVA.org:kth-116858DiVA: diva2:601124
FunderTrenOp, Transport Research Environment with Novel Perspectives
QC 201301282013-01-282013-01-282013-04-10Bibliographically approved