首页> 外文学位 >Bayesian methods for statistical disclosure control in microdata.
【24h】

Bayesian methods for statistical disclosure control in microdata.

机译:用于微数据中统计披露控制的贝叶斯方法。

获取原文
获取原文并翻译 | 示例

摘要

The fundamental tension in statistical disclosure control (SDC) of microdata is the trade-off between the protection of individual respondents and the release of enough information for statistical inferences. We consider microdata that include key variables that contain identifying information and target variables that include sensitive information. Most of the current SDC techniques release a single data set modified from the original to the public and result in biased statistical inferences in the modified data.; I propose two model-based Bayesian SDC methods for disclosure control in microdata, namely, selective multiple imputation of key variables (SMIKe) and multiple stochastic swapping of keys (MASSK). Both techniques release multiple independently modified data sets. The multiplicity of released data allows the incorporation of modification uncertainty into statistical inferences; disclosure risk in released data sets can be controlled to low levels; information loss is limited by the fact that the modification is restricted to the key variables for only a fraction of the total cases. Simulation studies and real data applications are used to evaluate these SDC techniques with respect to disclosure risk, information loss and quality of statistical inferences.
机译:微数据的统计披露控制(SDC)中的基本压力是在保护单个答复者和释放足够的信息以进行统计推断之间进行权衡。我们考虑微数据,这些数据包括包含标识信息的关键变量和包含敏感信息的目标变量。当前大多数的SDC技术都将原始数据修改后的单个数据集发布给公众,并导致修改后的数据中的统计推断有偏差。我提出了两种基于模型的贝叶斯SDC方法来控制微数据中的公开,即密钥的选择性多次插补(SMIKe)和密钥的多次随机交换(MASSK)。两种技术都释放多个独立修改的数据集。所发布数据的多样性允许将修改不确定性纳入统计推断;可以将已发布数据集中的披露风险控制在较低水平;信息丢失受到以下事实的限制:修改仅限于全部案例的一小部分。仿真研究和实际数据应用程序被用来评估这些SDC技术的披露风险,信息损失和统计推断的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号