...
【24h】

Hiding sensitive itemsets without side effects

机译:隐藏敏感的项目集,没有副作用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Data mining techniques are being used to discover useful patterns hidden in the data. However, these data mining techniques also extract sensitive information posing a threat to privacy. Frequent Itemset mining is a widely used data mining technique and a pre-processing step for Association Rule Mining. These frequent itemsets may contain sensitive itemsets which need to be hidden from adversaries. Traditional data sanitization techniques modify transactions in the database to hide sensitive itemsets which suffer from undesired side effects and information loss. In this paper, we propose a pattern sanitization approach to hide sensitive itemsets for privacy preserved pattern sharing. The transactional database is modeled as a set of lossless compact patterns using Closed Itemsets. The novelty of the proposed technique is in sanitizing the closed itemsets/patterns instead of transactions in the database. The proposed Recursive Pattern Sanitization (RPS) algorithm hides multiple sensitive itemsets irrespective of their size and support in single parse of the closed patterns. The patterns in the sanitized model retain the closeness property, and the model has inherent support for finding frequent itemsets and association rules reducing mining activity by the end user. Experimental results show that the proposed approach is effective in hiding sensitive itemsets without side effects and unexpected information loss compared to other well-known transaction modification based itemset hiding techniques.
机译:数据挖掘技术用于发现隐藏在数据中的有用模式。然而,这些数据挖掘技术还提取了对隐私威胁构成威胁的敏感信息。频繁的项目集挖掘是一个广泛使用的数据挖掘技术和用于关联规则挖掘的预处理步骤。这些频繁的项目集可能包含需要从对手隐藏的敏感项集。传统数据消毒技术修改数据库中的事务以隐藏敏感的项目集,这些项目集遭受不希望的副作用和信息丢失。在本文中,我们提出了一种模式消毒方法来隐藏隐私保留模式共享的敏感项集。交易数据库使用封闭项目设置为一组无损紧凑型图案。所提出的技术的新颖性是在消毒封闭的项目集/模式而不是数据库中的事务。所提出的递归模式消毒(RPS)算法隐藏了多个敏感项,而无论它们的尺寸和封闭式图案的单个解析都是如此。 Sanitized模型中的模式保留了近距离属性,并且该模型具有固有的支持,用于查找频繁的项目集和关联规则通过最终用户降低挖掘活动。实验结果表明,与其他基于事务修改的项目集隐藏技术相比,该方法在没有副作用和意外信息丢失的敏感项集中有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号