...
首页> 外文期刊>Computers & Security >Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction
【24h】

Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction

机译:隐私保留频繁项目集挖掘:基于数据库重建的最大化数据实用程序

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The process of frequent itemset mining (FIM) within large-scale databases plays a significant part in many knowledge discovery tasks, where, however, potential privacy breaches are possible. Privacy preserving frequent itemset mining (PPFIM) has thus drawn increasing attention recently, where the ultimate goal is to hide sensitive frequent itemsets (SFIs) so as to leave no confidential knowledge uncovered in the resulting database. Nevertheless, the vast majority of the proposed methods for PPFIM were merely based on database perturbation, which may result in a significant loss of data utility in order to conceal all SFIs. To alleviate this issue, this paper proposes a database reconstruction-based algorithm for PPFIM (DR-PPFIM) that can not only achieve a high degree of privacy but also afford a reasonable data utility. In DR-PPFIM, all SFIs with related frequent itemsets are first identified for removing in the pre-sanitize process by implementing a devised sanitize method. With the remained frequent itemsets, a novel database reconstruction scheme is proposed to reconstruct an appropriate database, where the concepts of inverse frequent itemset mining (IFIM) and database extension are efficiently integrated. In this way, all SFIs are able to be hidden under the same mining threshold while maximizing the data utility of the synthetic database as much as possible. Moreover, we also develop a further hiding strategy in DRPPFIM to further decrease the significance of SFIs with the purpose of reducing the risk of disclosing confidential knowledge. Extensive comparative experiments are conducted on real databases to demonstrate the superiority of DR-PPFIM in terms of maximizing the utility of data and resisting potential threats. (C) 2019 Published by Elsevier Ltd.
机译:在大规模数据库中频繁的项目集(FIM)的过程在许多知识发现任务中起重要部分,但是,潜在的隐私漏洞是可能的。保密保留频繁的项目集挖掘(PPFIM)最近引起了越来越多的关注,其中最终目标是隐藏敏感的频繁项目集(SFI),以便在生成的数据库中揭开没有机密知识。尽管如此,绝大多数用于PPFIM的提出方法仅基于数据库扰动,这可能导致数据效用的显着损失,以隐藏所有SFI。为了缓解此问题,本文提出了一种基于数据库重建的PPFIM(DR-PPFIM)算法,其不仅可以实现高度隐私,而且提供合理的数据实用程序。在DR-PPFIM中,首先识别具有相关频繁项集的所有SFI,用于通过实现设计设计的消毒方法来删除预清化过程中。凭借剩余的频繁项目集,建议进行新的数据库重建方案来重建适当的数据库,其中逆频繁项目集挖掘(IFIM)和数据库扩展的概念有效地集成。以这种方式,所有SFI都能够在相同的挖掘阈值下隐藏,同时尽可能最大化合成数据库的数据实用程序。此外,我们还在DRPPFIM中开发了进一步隐藏的策略,以进一步降低SFI的重要性,以降低披露机密知识的风险。广泛的比较实验是在实际数据库上进行的,以展示DR-PPFIM的优越性,以最大化数据的效用和抵制潜在威胁。 (c)2019年由elestvier有限公司发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号