Estimating uncertainty in respondent-driven sampling using a tree bootstrap method

Proc Natl Acad Sci U S A. 2016 Dec 20;113(51):14668-14673. doi: 10.1073/pnas.1617258113. Epub 2016 Dec 7.

Abstract

Respondent-driven sampling (RDS) is a network-based form of chain-referral sampling used to estimate attributes of populations that are difficult to access using standard survey tools. Although it has grown quickly in popularity since its introduction, the statistical properties of RDS estimates remain elusive. In particular, the sampling variability of these estimates has been shown to be much higher than previously acknowledged, and even methods designed to account for RDS result in misleadingly narrow confidence intervals. In this paper, we introduce a tree bootstrap method for estimating uncertainty in RDS estimates based on resampling recruitment trees. We use simulations from known social networks to show that the tree bootstrap method not only outperforms existing methods but also captures the high variability of RDS, even in extreme cases with high design effects. We also apply the method to data from injecting drug users in Ukraine. Unlike other methods, the tree bootstrap depends only on the structure of the sampled recruitment trees, not on the attributes being measured on the respondents, so correlations between attributes can be estimated as well as variability. Our results suggest that it is possible to accurately assess the high level of uncertainty inherent in RDS.

Keywords: HIV; hard-to-reach population; injecting drug user; snowball sampling; social network.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Adolescent
  • Adolescent Behavior
  • Algorithms
  • Centers for Disease Control and Prevention, U.S.
  • Colorado
  • Computer Simulation
  • Female
  • HIV Infections / epidemiology*
  • HIV Infections / transmission*
  • Heterosexuality
  • Humans
  • Longitudinal Studies
  • Male
  • Models, Statistical
  • Patient Selection*
  • Probability
  • Risk-Taking
  • Schools
  • Sex Workers
  • Sexual Behavior
  • Social Support*
  • Substance Abuse, Intravenous
  • Surveys and Questionnaires
  • Ukraine
  • Uncertainty
  • United States