Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction

Li Shaoxin; Mu Nankun; Le Junqing; Liao Xiaofeng

首页> 外文期刊>Computers & Security >Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction

【24h】

Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction

机译：隐私保留频繁项目集挖掘：基于数据库重建的最大化数据实用程序

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The process of frequent itemset mining (FIM) within large-scale databases plays a significant part in many knowledge discovery tasks, where, however, potential privacy breaches are possible. Privacy preserving frequent itemset mining (PPFIM) has thus drawn increasing attention recently, where the ultimate goal is to hide sensitive frequent itemsets (SFIs) so as to leave no confidential knowledge uncovered in the resulting database. Nevertheless, the vast majority of the proposed methods for PPFIM were merely based on database perturbation, which may result in a significant loss of data utility in order to conceal all SFIs. To alleviate this issue, this paper proposes a database reconstruction-based algorithm for PPFIM (DR-PPFIM) that can not only achieve a high degree of privacy but also afford a reasonable data utility. In DR-PPFIM, all SFIs with related frequent itemsets are first identified for removing in the pre-sanitize process by implementing a devised sanitize method. With the remained frequent itemsets, a novel database reconstruction scheme is proposed to reconstruct an appropriate database, where the concepts of inverse frequent itemset mining (IFIM) and database extension are efficiently integrated. In this way, all SFIs are able to be hidden under the same mining threshold while maximizing the data utility of the synthetic database as much as possible. Moreover, we also develop a further hiding strategy in DRPPFIM to further decrease the significance of SFIs with the purpose of reducing the risk of disclosing confidential knowledge. Extensive comparative experiments are conducted on real databases to demonstrate the superiority of DR-PPFIM in terms of maximizing the utility of data and resisting potential threats. (C) 2019 Published by Elsevier Ltd.

机译：在大规模数据库中频繁的项目集（FIM）的过程在许多知识发现任务中起重要部分，但是，潜在的隐私漏洞是可能的。保密保留频繁的项目集挖掘（PPFIM）最近引起了越来越多的关注，其中最终目标是隐藏敏感的频繁项目集（SFI），以便在生成的数据库中揭开没有机密知识。尽管如此，绝大多数用于PPFIM的提出方法仅基于数据库扰动，这可能导致数据效用的显着损失，以隐藏所有SFI。为了缓解此问题，本文提出了一种基于数据库重建的PPFIM（DR-PPFIM）算法，其不仅可以实现高度隐私，而且提供合理的数据实用程序。在DR-PPFIM中，首先识别具有相关频繁项集的所有SFI，用于通过实现设计设计的消毒方法来删除预清化过程中。凭借剩余的频繁项目集，建议进行新的数据库重建方案来重建适当的数据库，其中逆频繁项目集挖掘（IFIM）和数据库扩展的概念有效地集成。以这种方式，所有SFI都能够在相同的挖掘阈值下隐藏，同时尽可能最大化合成数据库的数据实用程序。此外，我们还在DRPPFIM中开发了进一步隐藏的策略，以进一步降低SFI的重要性，以降低披露机密知识的风险。广泛的比较实验是在实际数据库上进行的，以展示DR-PPFIM的优越性，以最大化数据的效用和抵制潜在威胁。（c）2019年由elestvier有限公司发布

著录项

来源
《Computers & Security》 |2019年第7期|17-34|共18页
作者
Li Shaoxin; Mu Nankun; Le Junqing; Liao Xiaofeng;
展开▼
作者单位

Southwest Univ Chongqing Key Lab Nonlinear Circuits & Intelligen Chongqing 400715 Peoples R China;

Southwest Univ Chongqing Key Lab Nonlinear Circuits & Intelligen Chongqing 400715 Peoples R China;

Southwest Univ Chongqing Key Lab Nonlinear Circuits & Intelligen Chongqing 400715 Peoples R China;

Chongqing Univ Coll Comp Chongqing 400044 Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Privacy preserving data mining; Frequent itemset; Database reconstruction; Inverse frequent itemset mining; Database extension;

机译：隐私保留数据挖掘;频繁的项目集;数据库重建;逆频繁项目集挖掘;数据库扩展;

相似文献

外文文献
中文文献
专利

1. Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction [J] . Li Shaoxin, Mu Nankun, Le Junqing, Computers & Security . 2019,第JULa期

机译：隐私保护频繁项集挖掘：基于数据库重建的数据实用程序最大化
2. Privacy-preserving frequent itemset mining in vertically partitioned database using symmetric homomorphic encryption scheme [J] . Jyoti Lamba, V.C. Venkaiah International journal of information privacy, security and integrity . 2020,第3期

机译：使用对称同性恋加密方案在垂直分区数据库中常用的频繁罚款挖掘
3. Transaction Database Modification Approaches for Achieving Privacy Preserving in High Utility Itemset Mining [J] . Advanced Science Letters . 2016,第9期

机译：在高实用程序项集挖掘中实现隐私保留的事务数据库修改方法
4. Privacy Preserving Frequent Itemsets Mining Based on Database Reconstruction [C] . Shaoxin Li, Nankun Mu, Xiaofeng Liao International Conference on Information Science and Technology . 2018

机译：基于数据库重构的隐私保护频繁项集挖掘
5. New algorithms for frequent sequential pattern and itemset data mining in certain and uncertain databases. [D] . Peterson, Erich Allen. 2012

机译：在某些不确定数据库中频繁进行顺序模式和项集数据挖掘的新算法。
6. Personalized Privacy-Preserving Frequent Itemset Mining Using Randomized Response [O] . Chongjing Sun, Yan Fu, Junlin Zhou, -1

机译：使用随机响应的个性化隐私保护频繁项集挖掘
7. Dummy Data Insert Scheme for Privacy Preserving Frequent Itemset Mining in Data Stream [O] . Jay Yeol Jung, Kee Sung Kim, Ik Rae Jeong 2013

机译：用于隐私保留在数据流中的频繁项目集挖掘的虚拟数据插入方案

Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction

摘要

著录项

相似文献

相关主题

期刊订阅