首页> 外文会议>2013 IEEE International Conference on Control, Automation, Robotics and Embedded Systems >Privacy preserving processing of high dimensional data classification based on sample selection and Singular Value Decomposition
【24h】

Privacy preserving processing of high dimensional data classification based on sample selection and Singular Value Decomposition

机译:基于样本选择和奇异值分解的高维数据分类隐私保护处理

获取原文
获取原文并翻译 | 示例

摘要

With the development of data mining technologies, privacy protection has become a challenge for data mining applications in many fields. To solve this problem, many privacy-preserving data mining methods have been proposed. One important type of such methods is based on Singular Value Decomposition (SVD). In the proposed algorithm, attributes are grouped according to their distance difference similarity by clustering the data set using decision tree classification. Secondly, the algorithm packetizes the attributes according to their SA value in each group. Thirdly, for each group it selects attributes from the smallest bucket and searches for a similar attributes in the attributes-1 largest buckets from the same group to create an equivalence class following the unique attribute-distinct diversity anonymization model. The proposed algorithm satisfies the “utility based anonymization principle that crucial information is protected from being suppressed. Also, weights given to attributes improve clustering and give the ability to control the generalization's depth. In prototype classification is combination of clustering and classification technique such methods are called ensemble classifier, this new proposed method is more efficient in balancing data privacy and data utility.
机译:随着数据挖掘技术的发展,隐私保护已成为许多领域中数据挖掘应用程序的挑战。为了解决这个问题,已经提出了许多保护隐私的数据挖掘方法。这种方法的一种重要类型是基于奇异值分解(SVD)。在提出的算法中,通过使用决策树分类对数据集进行聚类,根据属性的距离差异相似度对属性进行分组。其次,该算法根据属性在每个组中的SA值对数据包进行打包。第三,对于每个组,它从最小的存储桶中选择属性,并在同一组的attributes-1最大存储桶中搜索相似的属性,以遵循唯一的属性-不同的多样性匿名化模型创建等效类。所提出的算法满足“基于实用程序的匿名化原则,即关键信息受到保护而不会被抑制。同样,赋予属性的权重可改善聚类,并具有控制概括深度的能力。在原型分类中,聚类和分类技术相结合,这种方法称为集成分类器,该新提出的方法在平衡数据隐私和数据实用性方面更为有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号