A Condensation Approach to Privacy Preserving Data Mining

机译：一种浓缩隐私保护数据挖掘的方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, privacy preserving data mining has become an important problem because of the large amount of personal data which is tracked by many business applications. In many cases, users are unwilling to provide personal information unless the privacy of sensitive information is guaranteed. In this paper, we propose a new framework for privacy preserving data mining of multi-dimensional data. Previous work for privacy preserving data mining uses a perturbation approach which reconstructs data distributions in order to perform the mining. Such an approach treats each dimension independently and therefore ignores the correlations between the different dimensions. In addition, it requires the development of a new distribution based algorithm for each data mining problem, since it does not use the multi-dimensional records, but uses aggregate distributions of the data as input. This leads to a fundamental re-design of data mining algorithms. In this paper, we will develop a new and flexible approach for privacy preserving data mining which does not require new problem-specific algorithms, since it maps the original data set into a new anonymized data set. This anonymized data closely matches the characteristics of the original data including the correlations among the different dimensions. We present empirical results illustrating the effectiveness of the method.

机译：近年来，由于许多业务应用程序跟踪的大量个人数据，保护隐私的数据挖掘已成为一个重要问题。在许多情况下，除非敏感信息的私密性得到保证，否则用户不愿提供个人信息。在本文中，我们提出了一个用于多维数据隐私保护数据挖掘的新框架。先前的隐私保护数据挖掘工作使用一种扰动方法，该方法可以重建数据分布以执行挖掘。这种方法独立地对待每个维度，因此忽略了不同维度之间的相关性。另外，由于它不使用多维记录，而是使用数据的聚合分布作为输入，因此需要针对每个数据挖掘问题开发一种基于分布的新算法。这导致对数据挖掘算法的根本重新设计。在本文中，我们将开发一种新的灵活的隐私保护数据挖掘方法，该方法不需要新的特定于问题的算法，因为它将原始数据集映射到新的匿名数据集。该匿名数据紧密匹配原始数据的特征，包括不同维度之间的相关性。我们提供的经验结果说明了该方法的有效性。

著录项

来源
《International Conference on Extending Database Technology(EDBT 2004); 20040314-20040318; Heraklion; GR》|2004年|P.183-199|共17页
会议地点 Crete(GR);Crete(GR)
作者
Cham C. Aggarwal; Philip S. Yu;
展开▼
作者单位

IBM T. J. Watson Research Center, 19 Skyline Drive, Hawthorne, NY 10532;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类各种专用数据库;
关键词
入库时间 2022-08-26 14:11:46

相似文献

外文文献
中文文献
专利

1. On Static and Dynamic Methods for Condensation-Based Privacy-Preserving Data Mining [J] . CHARU C. AGGARWAL, PHILIP S. YU ACM transactions on database systems . 2008,第1期

机译：基于凝聚的隐私保护数据挖掘的静态和动态方法研究
2. Preliminary Data Analysis in Healthcare Multicentric Data Mining: a Privacy-preserving Distributed Approach [J] . Andrea Damiani, Carlotta Masciocchi, Luca Boldrini, Je-LKS . 2018,第1期

机译：医疗保健多中心数据挖掘的初步数据分析：一种隐私保留分布式方法
3. Improving the implementation of new approach data privacy preserving in data mining using slicing [J] . Ravindra S. Wanjari, Prof. Devi Kalpna International Journal of Engineering Research and Applications . 2013,第4期

机译：使用切片改进在数据挖掘中保护数据隐私的新方法的实现
4. A Condensation Approach to Privacy Preserving Data Mining [C] . Charu C. Aggarwal, Philip S. Yu International Conference on Extending Database Technology . 2004

机译：隐私保留数据挖掘的凝结方法
5. A Utility-Aware Privacy Preserving Framework For Distributed Data Mining With Worst Case Privacy Guarantee. [D] . Banerjee, Madhushri. 2011

机译：一个实用程序感知的隐私保护框架，用于具有最坏情况隐私保证的分布式数据挖掘。
6. An efficient reversible privacy-preserving data mining technology over data streams [O] . Chen-Yi Lin, Yuan-Hung Kao, Wei-Bin Lee, -1

机译：高效的可逆数据隐私保护数据挖掘技术
7. A Condensation Approach to Privacy Preserving Data Mining [O] . Charu Aggarwal And, Charu C. Aggarwal, Philip S. Yu 2004

机译：隐私保护数据挖掘的一种冷凝方法

A Condensation Approach to Privacy Preserving Data Mining

摘要

著录项

相似文献

相关主题

期刊订阅