首页> 外文会议>International Conference on Network-Based Information Systems >An Efficient Algorithm for Incremental Privacy Breach on (k, e)-Anonymous Model
【24h】

An Efficient Algorithm for Incremental Privacy Breach on (k, e)-Anonymous Model

机译:一种有效的算法(k,e) - 匿名模型的增量隐私违规算法

获取原文

摘要

Collaboration between business partners have become crucial these days. An important issue to be addressed is data privacy. In this paper, we address a problem of data privacy based on a prominent privacy model, (k, e)-Anonymous, when a new dataset is to be released, meanwhile there might be existing datasets released elsewhere. Since some attackers might obtain multiple versions of the datasets and compare them with the newly released dataset. Though, the privacy of all the datasets have been well-preserved individually, such comparison can lead to an privacy breach. We study the characteristics of the effects of multiple dataset releasing theoretically. It has been found that the privacy breach subjected to the increment occurs when there exists overlapping between any partition of the new dataset with any partition of any existing dataset. Based on our proposed studies, a polynomial time algorithm is proposed. Not only it needs only considering one previous version of the dataset, it also can skip computing the overlapping partitions. Thus, the computational complexity of the proposed algorithm is only O(pn3) where p is the number of partitions and n is the number of tuples, meanwhile the privacy of all released datasets as well as the optimal solution can be always guaranteed. In addition, the experiments results, which can illustrate the efficiency of our algorithm, on the real-world dataset is presented.
机译:这些天商业伙伴之间的合作变得至关重要。要解决的一个重要问题是数据隐私。在本文中,我们根据突出的隐私模型解决了数据隐私问题,(k,e) - 当要发布新数据集时,同时可能存在其他地方发布的现有数据集。由于某些攻击者可能获取多个版本的数据集并将它们与新发布的数据集进行比较。虽然,所有数据集的隐私都是完全保存的,这样的比较可以导致隐私违约。我们研究了大学理论上释放多个数据集的影响的特征。已经发现,当使用任何现有数据集的任何分区的新数据集的任何分区之间存在重叠时,发生递增的隐私泄露。基于我们所提出的研究,提出了一种多项式时间算法。不仅需要考虑一个以前的数据集版,它也可以跳过计算重叠分区。因此,所提出的算法的计算复杂性仅是O(pn 3 ),其中p是分区的数量,n是元组的数量,同时所有释放的数据集的隐私以及最佳可以始终保证解决方案。此外,实验结果可以说明我们算法的效率,在现实世界数据集上呈现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号