...
首页> 外文期刊>Computers and Electrical Engineering >An incremental privacy-preservation algorithm for the (k, e)-Anonymous model
【24h】

An incremental privacy-preservation algorithm for the (k, e)-Anonymous model

机译:(k,e)-匿名模型的增量隐私保护算法

获取原文
获取原文并翻译 | 示例
           

摘要

An important issue to be addressed when data are to be published is data privacy. In this paper, the problem of data privacy based on a prominent privacy model, (k, e)-Anonymous, is addressed. Our scenario is that when a new dataset is to be released, there may be, at the same time, datasets that were released elsewhere. A problem arises because some attackers might obtain multiple versions of the same dataset and compare them with the newly released dataset. Although the privacy of all of the datasets has been well-preserved individually, such a comparison can lead to a privacy breach, which is a so-called "incremental privacy breach". To address this problem effectively, we first study the characteristics the effects of multiple dataset releases with a theoretical approach. It has been found that a privacy breach that is subjected to an increment occurs when there is overlap between any parts of the new dataset with any parts of an existing dataset. Based on our proposed studies, a polynomial-time algorithm is proposed. This algorithm needs to consider only one previous version of the dataset, and it can also skip computing the overlapping partitions. Thus, the computational complexity of the proposed algorithm is reduced from O(n(m)) to only 0(pn(3)) where p is the number of partitions, n is the number of tuples, and m is the number of released datasets. At the same time, the privacy of all of the released datasets as well as the optimal solution can be always guaranteed. In addition, experiment results that illustrate the efficiency of our algorithm on real-world datasets are presented. (C) 2014 Elsevier Ltd. All rights reserved.
机译:要发布数据时要解决的重要问题是数据隐私。在本文中,解决了基于突出的隐私模型(k,e)-Anonymous的数据隐私问题。我们的场景是,当要发布新的数据集时,可能同时存在在其他地方发布的数据集。出现问题是因为某些攻击者可能获得同一数据集的多个版本,并将它们与新发布的数据集进行比较。尽管所有数据集的隐私都得到了单独妥善保存,但这样的比较可能会导致隐私泄露,这就是所谓的“增量隐私泄露”。为了有效解决这个问题,我们首先使用理论方法研究多个数据集发布的影响特征。已经发现,当新数据集的任何部分与现有数据集的任何部分之间存在重叠时,都会发生侵犯隐私的行为。在此基础上,提出了多项式时间算法。该算法只需要考虑数据集的一个先前版本,它也可以跳过计算重叠分区的过程。因此,提出的算法的计算复杂度从O(n(m))降低到仅0(pn(3)),其中p是分区数,n是元组数,m是释放数数据集。同时,可以始终保证所有已发布数据集的私密性以及最佳解决方案。此外,还提供了表明我们的算法在现实世界数据集上的效率的实验结果。 (C)2014 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号