...
【24h】

The GA-based algorithms for optimizing hiding sensitive itemsets through transaction deletion

机译:基于GA的算法,用于通过事务删除优化隐藏敏感项目集

获取原文
获取原文并翻译 | 示例
           

摘要

Data mining technology is used to extract useful knowledge from very large datasets, but the process of data collection and data dissemination may result in an inherent threat to privacy. Some sensitive or private information concerning individuals, businesses and organizations has to be suppressed before it is shared or published. Privacy-preserving data mining (PPDM) has become an important issue in recent years. In the past, many heuristic approaches were developed to sanitize databases for the purpose of hiding sensitive information in PPDM, but data sanitization of PPDM is considered to be an NP-hard problem. It is critical to find the balance between privacy protection for hiding sensitive information and maintaining the discovery of knowledge, or even reducing artificial knowledge in the sanitization process. In this paper, a GA-based framework with two optimization algorithms is proposed for data sanitization. A novel evaluation function with three concerned factors is designed to find the appropriate transactions to be deleted in order to hide sensitive itemsets. Experiments are then conducted to evaluate the performance of the proposed GA-based algorithms with regard to different factors such as the execution time, the number of hiding failures, the number of missing itemsets, the number of artificial itemsets, and database dissimilarity.
机译:数据挖掘技术用于从非常大的数据集中提取有用的知识,但是数据收集和数据分发的过程可能会对隐私产生固有的威胁。在共享或发布之前,必须禁止某些有关个人,企业和组织的敏感或私人信息。近年来,保护隐私的数据挖掘(PPDM)已成为一个重要问题。过去,为了在PPDM中隐藏敏感信息而开发了许多启发式方法来清理数据库,但是PPDM的数据清理被认为是一个NP难题。在隐私保护(用于隐藏敏感信息)与维护知识的发现,甚至在消毒过程中减少人工知识之间找到平衡,至关重要。本文提出了一种基于GA的框架,其中包含两种优化算法,用于数据清理。设计了具有三个相关因素的新颖评估功能,以找到要删除的适当交易,以隐藏敏感项目集。然后进行实验,以评估所提出的基于GA的算法在不同因素方面的性能,例如执行时间,隐藏失败的数量,缺失项集的数量,人工项集的数量和数据库的相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号