...
首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >An Improved Sanitization Algorithm in Privacy-Preserving Utility Mining
【24h】

An Improved Sanitization Algorithm in Privacy-Preserving Utility Mining

机译:一种改进的保护实用程序挖掘算法

获取原文

摘要

High-utility pattern mining is an effective technique that extracts significant information from varied types of databases. However, the analysis of data with sensitive private information may cause privacy concerns. To achieve better trade-off between utility maximizing and privacy preserving, privacy-preserving utility mining (PPUM) has become an important research topic in recent years. The MSICF algorithm is a sanitization algorithm for PPUM. It selects the item based on the conflict count and identifies the victim transaction based on the concept of utility. Although MSICF is effective, the heuristic selection strategy can be improved to obtain a lower ratio of side effects. In our paper, we propose an improved sanitization approach named the Improved Maximum Sensitive Itemsets Conflict First Algorithm (IMSICF) to address this issue. It dynamically calculates conflict counts of sensitive items in the sanitization process. In addition, IMSICF chooses the transaction with the minimum number of nonsensitive itemsets and the maximum utility in a sensitive itemset for modification. Extensive experiments have been conducted on various datasets to evaluate the effectiveness of our proposed algorithm. The results show that IMSICF outperforms other state-of-the-art algorithms in terms of minimizing side effects on nonsensitive information. Moreover, the influence of correlation among itemsets on various sanitization algorithms’ performance is observed.
机译:高实用模式挖掘是一种有效的技术,可以从各种类型的数据库中提取重要信息。但是,具有敏感私人信息的数据分析可能导致隐私问题。为了在公用事业最大化和隐私保存之间实现更好的权衡,近年来,隐私保存的公用事业挖掘(PPUM)已成为一个重要的研究主题。 MSICF算法是PPUM的消毒算法。它根据冲突计数选择该项目,并根据实用程序的概念标识受害者事务。虽然MSICF是有效的,但可以提高启发式选择策略以获得较低的副作用比例。在我们的论文中,我们提出了一种改进的消毒方法,命名为改进的最大敏感项目集冲突第一个算法(IMSICF)来解决这个问题。它动态计算了消毒过程中敏感项目的冲突计数。此外,IMSICF在敏感项目集中选择具有最小非敏感项集和最大实用程序的事务,以进行修改。已经在各种数据集中进行了广泛的实验,以评估我们所提出的算法的有效性。结果表明,在最小化对非敏信息的副作用方面,IMSICF在其他最新的算法方面优于其他最先进的算法。此外,观察了项目集之间的相关性对各种待遇算法的性能的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号