...
首页> 外文期刊>Engineering Applications of Artificial Intelligence >A novel algorithm for privacy preserving utility mining based on integer linear programming
【24h】

A novel algorithm for privacy preserving utility mining based on integer linear programming

机译:基于整数线性规划的隐私保护实用程序挖掘新算法

获取原文
获取原文并翻译 | 示例
           

摘要

As an important research topic, high-utility itemset mining (HUIM) has, of late years, attracted increasing attention, where both the significance and quantity factors of items are taken into account to mine high-utility itemsets (HUIs). Privacy breaches have always been a major issue existing in the field of data mining, which usually inevitably arise, especially when private data collections are publicly published or shared by organizations. To tackle this problem, plentiful methodologies regarding privacy-preserving data mining (PPDM) have been proposed. Due to the high practicality of HUIM, in recent years, privacy-preserving utility mining (PPUM) has become a popular research orientation in PPDM. The main goal of PPUM is to hide sensitive HUIs ( SHUIs) so as to leave no confidential information uncovered in the resulting sanitized database. However, all the previously proposed approaches have suffered from the defect of introducing numerous side effects by performing database perturbation. To alleviate this issue, in this paper, a novel algorithm based on integer linear programming (ILP) is proposed to obtain a lower ratio of side effects produced in the hiding process while does not reveal any sensitive information in the sanitized database. We formulate the hiding process as a constraint satisfaction problem (CSP), which pursuing the protection of SHUIs as well as the minimization of side effects. A solution to the hiding problem is expected to be obtained by exploiting ILP technique to solve the mapped problem, which properly indicates the processing manner of perturbation operation. In addition, a relaxation procedure is also adopted in the designed algorithm to provide an approximate solution of the CSP when the optimal one does not exist. Extensive experimental evaluations between our proposed method and other stateof-the-art algorithms are conducted on several real-world datasets. The comparative results demonstrate the superiorities of the proposed algorithm with respect to running time and the ability to minimize side effects.
机译:近年来,作为一项重要的研究主题,高实用性项目集挖掘(HUIM)引起了越来越多的关注,其中考虑了项目的重要性和数量因素来挖掘高实用性项目集(HUI)。隐私泄露一直是数据挖掘领域中存在的主要问题,通常不可避免地出现,尤其是在组织公开发布或共享私有数据集合时。为了解决这个问题,已经提出了很多关于隐私保护数据挖掘(PPDM)的方法。由于HUIM的高度实用性,近年来,隐私保护实用程序挖掘(PPUM)已成为PPDM中流行的研究方向。 PPUM的主要目标是隐藏敏感的HUI(SHUI),以便在生成的经过清理的数据库中不保留任何机密信息。但是,所有先前提出的方法都存在通过执行数据库扰动而引入大量副作用的缺陷。为了缓解这个问题,在本文中,提出了一种基于整数线性规划(ILP)的新算法,以降低隐藏过程中产生的副作用的比率,而不会在经过清理的数据库中显示任何敏感信息。我们将隐藏过程公式化为约束满足问题(CSP),该问题追求SHUI的保护以及副作用的最小化。期望通过利用ILP技术解决映射问题来解决隐藏问题,可以正确地指出扰动运算的处理方式。另外,在不存在最优解的情况下,在所设计的算法中还采用了松弛过程来提供CSP的近似解。我们在几种实际数据集上对我们提出的方法与其他最新算法进行了广泛的实验评估。比较结果证明了所提出算法在运行时间和最小化副作用方面的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号