首页> 外文会议>CASC Project Final Conference on Privacy in Statistical Databases(PSD 2004); 20040609-20040611; Barcelona; ES >A Bayesian Hierarchical Model Approach to Risk Estimation in Statistical Disclosure Limitation
【24h】

A Bayesian Hierarchical Model Approach to Risk Estimation in Statistical Disclosure Limitation

机译:统计披露限制中的贝叶斯层次模型风险估计

获取原文
获取原文并翻译 | 示例

摘要

When microdata files for research are released, it is possible that external users may attempt to breach confidentiality. For this reason most National Statistical Institutes apply some form of disclosure risk assessment and data protection. Risk assessment first requires a measure of disclosure risk to be defined. In this paper we build on previous work by [BF98] to define a Bayesian hierarchical model for risk estimation. We follow a superpopulation approach similar to [BKP90] and [Rin03]. For each combination of values of the key variables we derive the posterior distribution of the population frequency given the observed sample frequency. Knowledge of this posterior distribution enables us to obtain suitable summaries that can be used to estimate the risk of disclosure. One such summary is the mean of the reciprocal of the population frequency or Benedetti-Franconi risk, but we also investigate others such as the mode. We apply our approach to an artificial sample of the Italian 1991 Census data, drawn by means of a widely used sampling scheme. We report on results of this application and document the computational difficulties that we encountered. The risk estimates that we obtain are sensible, but suggest possible improvements and modifications to our methodology. We discuss these together with potential alternative strategies.
机译:发布用于研究的微数据文件时,外部用户可能会尝试破坏机密性。因此,大多数国家统计局都采用某种形式的披露风险评估和数据保护。风险评估首先需要定义披露风险的度量。在本文中,我们基于[BF98]的先前工作来定义用于风险估计的贝叶斯分层模型。我们遵循类似于[BKP90]和[Rin03]的人口过多方法。对于关键变量值的每种组合,我们在给定观察到的采样频率的情况下得出总体频率的后验分布。关于这种后验分布的知识使我们能够获取合适的摘要,以用于估计披露的风险。这样的总结之一就是人口频率或Benedetti-Franconi风险倒数的平均值,但我们还调查了其他诸如模式。我们将我们的方法应用于通过广泛使用的抽样方案绘制的意大利1991年人口普查数据的人工样本。我们报告此应用程序的结果并记录我们遇到的计算困难。我们获得的风险估计是合理的,但建议对我们的方法进行可能的改进和修改。我们将与潜在的替代策略一起讨论这些。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号