...
首页> 外文期刊>Indian Journal of Science and Technology >Privacy Preserving Data Mining for Ordinal Data using Correlation Based Transformation Strategy (CBTS)
【24h】

Privacy Preserving Data Mining for Ordinal Data using Correlation Based Transformation Strategy (CBTS)

机译:使用基于相关的转换策略(CBTS)对序数数据进行隐私保护的数据挖掘

获取原文
           

摘要

Objectives: Preservation of privacy is a significant aspect of data mining. The main objective of PPDM is to hide or provide privacy to certain sensitive information so that they can be protected from unauthorized parties or intruders. Methods/ Statistical Analysis: Though privacy is achieved by hiding the sensitive or private data, it will affect the data mining algorithms in knowledge extraction, so an effective method or strategy is required to provide privacy to the data and simultaneously protecting the quality of data mining algorithms. Instead of removing or encrypting sensitive or private data, we make use of data transformation strategies that keep the statistical, semantic and heuristic nature of data while protecting the sensitive or private data. Findings: In this paper we studied the technical feasibility of realizing Privacy Preserving Data Mining. In the proposed work, Correlation Based Transformation Strategy for Privacy Preserving Data Mining is used for ordinal data. We apply the method on few datasets namely soybean, Breast Cancer, Nursery dataset and Car dataset. We tabulate the end results applying the proposed strategy on both the original and the transformed dataset and observe correlation difference, Information Entropy and Classification Accuracy with different machine learning algorithms and Clustering Quality. Application/Improvements: As an improvement, the proposed work can be extended by use of vector marking techniques where these techniques help in increasing the efficiency by avoiding unauthorised access to the information.
机译:目标:保护隐私是数据挖掘的重要方面。 PPDM的主要目的是隐藏或提供某些敏感信息的隐私,以便可以保护它们免受未授权方或入侵者的侵害。方法/统计分析:尽管隐私是通过隐藏敏感或私有数据来实现的,但它会影响知识提取中的数据挖掘算法,因此需要一种有效的方法或策略来为数据提供隐私并同时保护数据挖掘的质量算法。我们没有使用删除或加密敏感或私有数据的方法,而是使用数据转换策略,这些策略在保护敏感或私有数据的同时保持数据的统计,语义和启发式性质。发现:本文研究了实现隐私保护数据挖掘的技术可行性。在拟议的工作中,用于隐私保护数据挖掘的基于相关性的转换策略用于序数据。我们将该方法应用于大豆,乳腺癌,苗圃数据和汽车数据集等少数数据集。我们将提议的策略应用于原始数据集和转换后的数据集,将最终结果制成表格,并观察不同机器学习算法和聚类质量的相关差异,信息熵和分类精度。应用/改进:作为改进,可以通过使用矢量标记技术来扩展建议的工作,其中这些技术可通过避免未经授权的信息访问来帮助提高效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号