...
首页> 外文期刊>Statistical papers >Using prior information in privacy-protecting survey designs for categorical sensitive variables
【24h】

Using prior information in privacy-protecting survey designs for categorical sensitive variables

机译:在保护隐私的调查设计中使用先验信息来分类敏感变量

获取原文
获取原文并翻译 | 示例
           

摘要

To gather data on sensitive characteristics, such as annual income, tax evasion, insurance fraud or students' cheating behavior, direct questioning is problematic, because it often results in answer refusal or untruthful responses. For this reason, several randomized response (RR) and nonrandomized response (NRR) survey designs, which increase cooperation by protecting the respondents' privacy, have been proposed in the literature. In the first part of this paper, we present a Bayesian extension of a recently published, innovative NRR method for multichotomous sensitive variables. With this extension, the investigator is able to incorporate prior information on the parameter, e.g., based on a previous study, into the estimation and to improve the estimation precision. In particular, we derive different point and interval estimates by the EM algorithm and data augmentation. The performance of the considered estimators is evaluated in a simulation study. In the second part of this article, we show that for any RR or NRR model addressing the estimation of the distribution of a categorical sensitive characteristic, the design matrices of the model play the central role for the Bayes estimation whereas the concrete answer scheme is irrelevant. This observation enables us to widely generalize the calculations from the first part and to establish a common approach for Bayes inference in RR and NRR designs for categorical sensitive variables. This unified approach covers even multi-stage models and models that require more than one sample.
机译:要收集有关敏感特征(例如年收入,逃税,保险欺诈或学生的作弊行为)的数据,直接提问是有问题的,因为它经常导致回答被拒绝或不真实的回答。因此,文献中提出了几种随机响应(RR)和非随机响应(NRR)调查设计,它们通过保护受访者的隐私来增强合作。在本文的第一部分中,我们提出了针对多变量敏感变量的最新发布的创新NRR方法的贝叶斯扩展。通过此扩展,研究人员能够例如基于先前的研究将关于参数的先前信息合并到估计中,并提高估计精度。特别是,我们通过EM算法和数据扩充来得出不同的点和区间估计。在模拟研究中评估了所考虑的估计器的性能。在本文的第二部分中,我们表明,对于解决类别敏感特征分布估计的任何RR或NRR模型,该模型的设计矩阵对于Bayes估计起着核心作用,而具体的答案方案则无关紧要。 。该观察结果使我们能够广泛地概括第一部分的计算,并为分类敏感变量的RR和NRR设计中的贝叶斯推理建立通用方法。这种统一的方法甚至涵盖了多阶段模型以及需要多个样本的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号