首页> 外文会议>Privacy in Statistical Databases; Lecture Notes in Computer Science; 4302 >A Generalized Negative Binomial Smoothing Model for Sample Disclosure Risk Estimation
【24h】

A Generalized Negative Binomial Smoothing Model for Sample Disclosure Risk Estimation

机译:样本披露风险估计的广义负二项式平滑模型

获取原文
获取原文并翻译 | 示例

摘要

We deal with the issue of risk estimation in a sample frequency table to be released by an agency. Risk arises from non-empty sample cells which represent small population cells and from population uniques in particular. Therefore risk estimation requires assessing which of the relevant population cells are indeed small. Various methods have been proposed for this task, and we present a new method in which estimation of a population cell frequency is based on smoothing using a local neighborhood of this cell, that is, cells having similar or close values in all attributes. The statistical model we use is a generalized Negative Binomial model which subsumes the Poisson and Negative Binomial models. We provide some preliminary results and experiments with this method. Comparisons of the new approach are made to a method based on Poisson regression log-linear hierarchical model, in which inference on a given cell is based on classical models of contingency tables. Such models connect each cell to a 'neighborhood' of cells with one or several common attributes, but some other attributes may differ significantly. We also compare to the Argus Negative Binomial method in which inference on a given cell is based only on sampling weights, without learning from any type of 'neighborhood' of the given cell and without making use of the structure of the table.
机译:我们在代理商发布的样本频率表中处理风险估算问题。风险来自代表小种群细胞的非空样本细胞,尤其是种群的唯一性。因此,风险评估需要评估哪些相关种群细胞确实很小。已经提出了用于该任务的各种方法,并且我们提出了一种新方法,其中,基于使用该小区的局部邻域(即,在所有属性中具有相似或接近值的小区)进行平滑,来估计种群小区频率。我们使用的统计模型是广义负二项式模型,它包含了泊松模型和负二项式模型。我们提供了一些使用该方法的初步结果和实验。新方法与基于Poisson回归对数线性分层模型的方法进行了比较,其中对给定单元格的推论基于列联表的经典模型。此类模型将每个像元连接到具有一个或几个通用属性的像元“邻居”,但是某些其他属性可能会显着不同。我们还与阿格斯负二项式方法进行了比较,在该方法中,对给定单元格的推断仅基于采样权重,而无需学习给定单元格的任何“邻域”类型,也无需利用表格的结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号