...
首页> 外文期刊>Intelligent data analysis >A privacy preserving clustering technique for horizontally and vertically distributed datasets
【24h】

A privacy preserving clustering technique for horizontally and vertically distributed datasets

机译:水平和垂直分布数据集的隐私保护聚类技术

获取原文
获取原文并翻译 | 示例
           

摘要

Despite the benefits of data mining in a wide range of applications, this technique has raised some issues related to the privacy and security of individuals. Due to these issues, data owners may prevent to share their sensitive information with data miners. On the other hand, in distributed environments, other issues related to the distribution of data will raise, which will make the preservation of privacy more challengeable. To solve these problems, different privacy preserving data mining (PPDM) techniques have been introduced. In this paper, a new privacy preserving clustering (PPC) technique for horizontally and vertically distributed datasets is proposed. The proposed technique uses Haar wavelet transforms (HWT) and scaling data perturbation (SDP) to achieve both data hiding and data reduction for protecting private numerical attribute values in distributed datasets. The results of our evaluations demonstrated that the proposed technique provides a proper degree of privacy and quality of clustering for distributed datasets and also runs fast. Our experiments have also shown that the proposed technique provides better privacy and clustering results comparing to the other existing privacy preserving clustering techniques applicable to distributed datasets. The proposed algorithms and the results of their experimental evaluations using different datasets are presented in this paper.
机译:尽管数据挖掘在许多应用程序中都有好处,但该技术提出了一些与个人隐私和安全性有关的问题。由于这些问题,数据所有者可能会阻止与数据挖掘者共享其敏感信息。另一方面,在分布式环境中,与数据分发有关的其他问题将会提出,这将使隐私保护更具挑战性。为了解决这些问题,已经引入了不同的隐私保护数据挖掘(PPDM)技术。本文提出了一种新的针对水平和垂直分布数据集的隐私保护聚类(PPC)技术。所提出的技术使用Haar小波变换(HWT)和缩放数据扰动(SDP)来实现数据隐藏和数据归约,以保护分布式数据集中的私有数值属性值。我们的评估结果表明,所提出的技术为分布式数据集提供了适当程度的隐私和聚类质量,并且运行速度很快。我们的实验还表明,与适用于分布式数据集的其他现有隐私保护聚类技术相比,所提出的技术提供了更好的隐私和聚类结果。本文介绍了所提出的算法以及使用不同数据集进行实验评估的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号