首页> 外文期刊>Journal of the American statistical association >Improved Inference for Respondent-Driven Sampling Data With Application to HIV Prevalence Estimation
【24h】

Improved Inference for Respondent-Driven Sampling Data With Application to HIV Prevalence Estimation

机译:响应者驱动的采样数据的改进推理及其在HIV流行率估计中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Respondent-driven sampling is a form of link-tracing network sampling, which is widely used to study hard-to-reach populations, often to estimate population proportions. Previous treatments of this process have used a with-replacement approximation, which we show induces bias in estimates for large sample fractions and differential network connectedness by characteristic of interest. We present a treatment of respondent-driven sampling as a successive sampling process. Unlike existing representations, our approach respects the essential without-replacement feature of the process, while converging to an existing with-replacement representation for small sample fractions, and to the sample mean for a full-population sample. We present a successive-sampling based estimator for population means based on respondent-driven sampling data, and demonstrate its superior performance when the size of the hidden population is known. We present sensitivity analyses for unknown population sizes. In addition, we note that like other existing estimators, our new estimator is subject to bias induced by the selection of the initial sample. Using data collected among three populations in two countries, we illustrate the application of this approach to populations with varying characteristics. We conclude that the successive sampling estimator improves on existing estimators, and can also be used as a diagnostic tool when population size is not known. This article has supplementary material online.
机译:响应者驱动的抽样是链接跟踪网络抽样的一种形式,广泛用于研究难以到达的人口,通常用于估计人口比例。该过程的先前处理使用了带置换近似,我们发现,根据感兴趣的特征,大样本分数和差分网络连接性的估计值会产生偏差。我们将响应者驱动的采样作为连续采样过程进行处理。与现有表示法不同,我们的方法尊重过程的基本无替换特征,同时收敛于小样本部分的现有有替换表示,以及全样本的样本均值。我们基于响应者驱动的采样数据,提出了基于连续抽样的总体均值估算器,并在已知隐藏总体的大小时证明了其优越的性能。我们提出了未知人口规模的敏感性分析。此外,我们注意到,像其他现有的估算器一样,我们的新估算器也会受到初始样本选择引起的偏差的影响。使用在两个国家的三个人口中收集的数据,我们说明了该方法在具有不同特征的人口中的应用。我们得出的结论是,连续抽样估计量在现有估计量的基础上有所改进,并且在人口规模未知时也可以用作诊断工具。本文在线提供了补充材料。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号