Unknown

Dataset Information

0

Estimating uncertainty in respondent-driven sampling using a tree bootstrap method.


ABSTRACT: Respondent-driven sampling (RDS) is a network-based form of chain-referral sampling used to estimate attributes of populations that are difficult to access using standard survey tools. Although it has grown quickly in popularity since its introduction, the statistical properties of RDS estimates remain elusive. In particular, the sampling variability of these estimates has been shown to be much higher than previously acknowledged, and even methods designed to account for RDS result in misleadingly narrow confidence intervals. In this paper, we introduce a tree bootstrap method for estimating uncertainty in RDS estimates based on resampling recruitment trees. We use simulations from known social networks to show that the tree bootstrap method not only outperforms existing methods but also captures the high variability of RDS, even in extreme cases with high design effects. We also apply the method to data from injecting drug users in Ukraine. Unlike other methods, the tree bootstrap depends only on the structure of the sampled recruitment trees, not on the attributes being measured on the respondents, so correlations between attributes can be estimated as well as variability. Our results suggest that it is possible to accurately assess the high level of uncertainty inherent in RDS.

SUBMITTER: Baraff AJ 

PROVIDER: S-EPMC5187726 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Estimating uncertainty in respondent-driven sampling using a tree bootstrap method.

Baraff Aaron J AJ   McCormick Tyler H TH   Raftery Adrian E AE  

Proceedings of the National Academy of Sciences of the United States of America 20161207 51


Respondent-driven sampling (RDS) is a network-based form of chain-referral sampling used to estimate attributes of populations that are difficult to access using standard survey tools. Although it has grown quickly in popularity since its introduction, the statistical properties of RDS estimates remain elusive. In particular, the sampling variability of these estimates has been shown to be much higher than previously acknowledged, and even methods designed to account for RDS result in misleading  ...[more]

Similar Datasets

| S-EPMC7228542 | biostudies-literature
| S-EPMC4418439 | biostudies-literature
| S-EPMC2872407 | biostudies-literature
| S-EPMC3277908 | biostudies-literature
| S-EPMC4877136 | biostudies-literature
| S-EPMC3814964 | biostudies-literature
| S-EPMC6788810 | biostudies-literature
| S-EPMC8424528 | biostudies-literature
| S-EPMC1705484 | biostudies-other
| S-EPMC6537868 | biostudies-literature