Limiting Privacy Breaches in Privacy Preserving Data Mining

机译：限制隐私违约，保留数据挖掘

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There has been increasing interest in the problem of building accurate data mining models over aggregate data, while protecting privacy at the level of individual records. One approach for this problem is to randomize the values in individual records, and only disclose the randomized values. The model is then built over the randomized data, after first compensating for the randomization (at the aggregate level). This approach is potentially vulnerable to privacy breaches: based on the distribution of the data, one may be able to learn with high confidence that some of the randomized records satisfy a specified property, even though privacy is preserved on average. In this paper, we present a new formulation of privacy breaches, together with a methodology, "amplification", for limiting them. Unlike earlier approaches, amplification makes it is possible to guarantee limits on privacy breaches without any knowledge of the distribution of the original data. We instantiate this methodology for the problem of mining association rules, and modify the algorithm from [9] to limit privacy breaches without knowledge of the data distribution. Next, we address the problem that the amount of randomization required to avoid privacy breaches (when mining association rules) results in very long transactions. By using pseudorandom generators and carefully choosing seeds such that the desired items from the original transaction are present in the randomized transaction, we can send just the seed instead of the transaction, resulting in a dramatic drop in communication and storage cost. Finally, we define new information measures that take privacy breaches into account when quantifying the amount of privacy preserved by randomization.

机译：在构建精确的数据挖掘模型上，越来越感兴趣地通过总数据，同时保护个人记录水平的隐私。此问题的一种方法是在单个记录中随机化值，并且仅披露随机值。然后在首先补偿随机化（在聚合级别）之后，将模型构建在随机数据上。这种方法可能很容易受到隐私违规的影响：基于数据的分布，可以高度信心地学习一些随机记录满足指定的属性，即使隐私是平均保存的。在本文中，我们提出了一种新的违规行为的制定，以及一种方法，“放大”，以限制它们。与早期的方法不同，放大使得可以保证对隐私泄露的限制而不知道原始数据分配的任何了解。我们将该方法实例化了挖掘关联规则的问题，并修改[9]的算法，以限制隐私漏洞，而无需了解数据分布。接下来，我们解决了避免隐私漏洞所需的随机化量（挖掘关联规则）所需的随机化量会导致非常长的交易。通过使用伪随机发生器并仔细选择种子，使得来自原始交易的所需物品存在于随机交易中，我们可以只发送种子而不是交易，导致通信和存储成本的戏剧下降。最后，我们在量化随机化保留的隐私金额时，我们定义了采取隐私违约的新信息措施。

著录项

来源
《ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems》|2003年||共12页
会议地点
作者
Alexandre Evfimievski; Johannes Gehrke; Ramakrishnan Srikant;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Reversible privacy preserving data mining: a combination of difference expansion and privacy preserving [J] . Tung-Shou Chen, Wei-Bin Lee, Jeanne Chen, Journal of supercomputing . 2013,第2期

机译：可逆的隐私保护数据挖掘：差异扩展和隐私保护的结合
2. Managing privacy of sensitive attributes using fuzzy-based data transformation methods in privacy preserving data mining environment [J] . V.K. Saxena, Shashank Pushkar International journal of business information systems . 2019,第2期

机译：在隐私保护数据挖掘环境中使用基于模糊的数据转换方法管理敏感属性的隐私
3. Rings for Privacy: An Architecture for Large Scale Privacy-Preserving Data Mining [J] . Maria Luisa Merani, Daniele Croce, Ilenia Tinnirello Parallel and Distributed Systems, IEEE Transactions on . 2021,第6期

机译：隐私的戒指：大规模隐私保留数据挖掘的架构
4. Limiting Privacy Breaches in Privacy Preserving Data Mining [C] . Alexandre Evfimievski, Johannes Gehrke, Ramakrishnan Srikant ACM(Association for Computing Machinery) SIGMOD(Special Interest Group for the Management of Data)-SIGACT((Special Interest Group for Algorithms and Computation Theory)-SIGART(Special Interest Group for Artificial Intelligence) Symposium on Principles of . 2003

机译：限制隐私保护数据挖掘中的隐私违规行为
5. A Utility-Aware Privacy Preserving Framework For Distributed Data Mining With Worst Case Privacy Guarantee. [D] . Banerjee, Madhushri. 2011

机译：一个实用程序感知的隐私保护框架，用于具有最坏情况隐私保证的分布式数据挖掘。
6. An efficient reversible privacy-preserving data mining technology over data streams [O] . Chen-Yi Lin, Yuan-Hung Kao, Wei-Bin Lee, -1

机译：高效的可逆数据隐私保护数据挖掘技术
7. Limiting Privacy Breaches in Privacy Preserving Data Mining [O] . Alexandre Evfimievski, Johannes Gehrke, Ramakrishnan Srikant 2003

机译：限制隐私保护数据挖掘中的隐私违规行为

Limiting Privacy Breaches in Privacy Preserving Data Mining

摘要

著录项

相似文献

相关主题

期刊订阅