首页> 外文会议>ACM SIGMOD international conference on management of data >Non-homogeneous Generalization in Privacy Preserving Data Publishing
【24h】

Non-homogeneous Generalization in Privacy Preserving Data Publishing

机译:保留数据发布的隐私概括的非同质概括

获取原文

摘要

Most previous research on privacy-preserving data publishing, based on the fc-anonymity model, has followed the simplistic approach of homogeneously giving the same generalized value in all quasi-identifiers within a partition. We observe that the anonymiza-tion error can be reduced if we follow a non-homogeneous generalization approach for groups of size larger than fc. Such an approach would allow tuples within a partition to take different generalized quasi-identifier values. Anonymization following this model is not trivial, as its direct application can easily violate fc-anonymity. In addition, non-homogeneous generalization allows for additional types of attack, which should be considered in the process. We provide a methodology for verifying whether a non-homogeneous generalization violates fc-anonymity. Then, we propose a technique that generates a non-homogeneous generalization for a partition and show that its result satisfies fc-anonymity, however by straightforwardly applying it, privacy can be compromised if the attacker knows the anonymization algorithm. Based on this, we propose a randomization method that prevents this type of attack and show that fc-anonymity is not compromised by it. Non-homogeneous generalization can be used on top of any existing partitioning approach to improve its utility. In addition, we show that a new partitioning technique tailored for non-homogeneous generalization can further improve quality. A thorough experimental evaluation demonstrates that our methodology greatly improves the utility of anonymized data in practice.
机译:基于FC-匿名模型的基于FC-Anymony模型的最先前关于隐私保留数据发布的研究遵循了分区内所有准标识符中相同的广义值的简单方法。我们观察到,如果我们遵循大小大于FC的统一组的非同质泛化方法,可以减少匿名挑错的错误。这样的方法将允许分区内的元组采用不同的广义准标识符值。在此模型之后的匿名化并不重要,因为它的直接应用程序可以轻松违反FC-Anonyment。此外,非均匀概括允许额外的攻击类型,这应该在该过程中考虑。我们提供了一种验证非同质概念是否违反FC-Ononyment的方法。然后,我们提出了一种技术为分区生成非同质泛化,并表明其结果满足FC - 匿名性,但是通过直接应用它,如果攻击者知道匿名化算法,则可以损害隐私。基于此,我们提出了一种随机化方法,可以防止这种类型的攻击,并表明FC-Anonymity不会受到影响。非均匀泛化可用于任何现有的分区方法,以改善其实用程序。此外,我们表明,为非均匀概括而定制的新分区技术可以进一步提高质量。彻底的实验评估表明我们的方法大大提高了在实践中匿名数据的效用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号