首页> 外文期刊>International Journal of Advanced Information Technology >An Efficient Algorithm for Anonymization of Set-Valued Data and Representation Using Fp-Tree
【24h】

An Efficient Algorithm for Anonymization of Set-Valued Data and Representation Using Fp-Tree

机译:集值数据的匿名化和使用Fp-Tree表示的高效算法

获取原文
       

摘要

Data anonymization techniques enable publication of detailed information, while providing the privacy ofsensitive information in the data against a variety of attacks. Anonymized data describes a set of possibleworlds that include the original data. Generalization and suppression have been the most commonly usedtechniques for achieving anonymization. Some algorithms to protect privacy in the publication of set-valued data were developed by Terrovitis et al .,[16]. The concept of k-anonymity was introduced bySamarati and Sweeny [15], so that every tuple has at least (k-1) tuples identical with it. This concept wasmodified in [16] in order to introducemk-anonymity, to limit the effects of the data dimensionality. Thisapproach depends upon generalisation instead of suppression.To handle this problem two heuristicalgorithms; namely the DA-algorithm and the AA-algorithm were developed by them.These alogorithmsprovide near optimal solutions in many cases.In this paper,we improve DA such that undesirableduplicates are not generated andwe can display the anonymized data even in the FP-Tree way.Weillustrate through suitable examples,the efficiency of our proposed algorithm.
机译:数据匿名化技术可以发布详细信息,同时针对各种攻击在数据中提供敏感信息的隐私性。匿名数据描述了包含原始数据的一组可能世界。泛化和抑制是实现匿名化的最常用技术。 Terrovitis等人[16]开发了一些保护集合值数据公开中的隐私的算法。 Samarati和Sweeny [15]引入了k-匿名性的概念,因此每个元组至少具有(k-1)个与之相同的元组。在[16]中对该概念进行了修改,以引入mk匿名性,以限制数据维数的影响。此方法依赖于泛化而不是抑制。要解决此问题,有两种启发式算法;这些算法在许多情况下提供了接近最优的解决方案。在本文中,我们对DA进行了改进,以使得不会生成不希望的重复项,并且即使以FP-Tree方式也可以显示匿名数据。我们通过适当的例子来说明我们提出的算法的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号