Skip navigation

Respondent-driven sampling bias induced by community structure and response rates in social networks

Respondent-driven sampling bias induced by community structure and response rates in social networks

Rocha, Luis E. C. ORCID logoORCID: https://orcid.org/0000-0001-9046-8739, Thorson, Anna E., Lambiotte, Renaud and Liljeros, Fredrik (2016) Respondent-driven sampling bias induced by community structure and response rates in social networks. Journal of the Royal Statistical Society: Series A (Statistics in Society), 180 (1). pp. 99-118. ISSN 0964-1998 (Print), 1467-985X (Online) (doi:10.1111/rssa.12180)

[thumbnail of Author Accepted Manuscript]
Preview
PDF (Author Accepted Manuscript)
19633 ROCHA_Respondent-Driven_Sampling_Bias_2016.pdf - Accepted Version

Download (2MB) | Preview

Abstract

Sampling hidden populations is particularly challenging by using standard sampling methods mainly because of the lack of a sampling frame. Respondent‐driven sampling is an alternative methodology that exploits the social contacts between peers to reach and weight individuals in these hard‐to‐reach populations. It is a snowball sampling procedure where the weight of the respondents is adjusted for the likelihood of being sampled due to differences in the number of contacts. The structure of the social contacts thus regulates the process by constraining the sampling within subregions of the network. We study the bias induced by network communities, which are groups of individuals more connected between themselves than with individuals in other groups, in the respondent‐driven sampling estimator. We simulate different structures and response rates to reproduce real settings. We find that the prevalence of the estimated variable is associated with the size of the network community to which the individual belongs and observe that low degree nodes may be undersampled if the sample and the network are of similar size. We also find that respondent‐driven sampling estimators perform well if response rates are relatively large and the community structure is weak, whereas low response rates typically generate strong biases irrespectively of the community structure.

Item Type: Article
Uncontrolled Keywords: Respondent-sampling method, sampling, hard-to-reach population, social networks
Subjects: H Social Sciences > HA Statistics
Faculty / School / Research Centre / Research Group: Faculty of Business
Faculty of Business > Networks and Urban Systems Centre (NUSC) > Centre for Business Network Analysis (CBNA)
Faculty of Business > Department of International Business & Economics
Last Modified: 08 May 2019 23:23
URI: http://gala.gre.ac.uk/id/eprint/19633

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics