首页> 外文期刊>Engineering Applications of Artificial Intelligence >A novel algorithm for privacy preserving utility mining based on integer linear programming
【24h】

A novel algorithm for privacy preserving utility mining based on integer linear programming

机译:一种基于整数线性规划的隐私保存公用事业挖掘算法

获取原文
获取原文并翻译 | 示例
           

摘要

As an important research topic, high-utility itemset mining (HUIM) has, of late years, attracted increasing attention, where both the significance and quantity factors of items are taken into account to mine high-utility itemsets (HUIs). Privacy breaches have always been a major issue existing in the field of data mining, which usually inevitably arise, especially when private data collections are publicly published or shared by organizations. To tackle this problem, plentiful methodologies regarding privacy-preserving data mining (PPDM) have been proposed. Due to the high practicality of HUIM, in recent years, privacy-preserving utility mining (PPUM) has become a popular research orientation in PPDM. The main goal of PPUM is to hide sensitive HUIs ( SHUIs) so as to leave no confidential information uncovered in the resulting sanitized database. However, all the previously proposed approaches have suffered from the defect of introducing numerous side effects by performing database perturbation. To alleviate this issue, in this paper, a novel algorithm based on integer linear programming (ILP) is proposed to obtain a lower ratio of side effects produced in the hiding process while does not reveal any sensitive information in the sanitized database. We formulate the hiding process as a constraint satisfaction problem (CSP), which pursuing the protection of SHUIs as well as the minimization of side effects. A solution to the hiding problem is expected to be obtained by exploiting ILP technique to solve the mapped problem, which properly indicates the processing manner of perturbation operation. In addition, a relaxation procedure is also adopted in the designed algorithm to provide an approximate solution of the CSP when the optimal one does not exist. Extensive experimental evaluations between our proposed method and other stateof-the-art algorithms are conducted on several real-world datasets. The comparative results demonstrate the superiorities of the proposed algorithm with respect to running time and the ability to minimize side effects.
机译:作为一个重要的研究课题,高历年的高效项目集(Huim)引起了越来越多的关注,其中物品的意义和数量因素都考虑到矿井高实用项目集(Huis)。隐私违规始终是存在于数据挖掘领域的主要问题,这些挖掘领域通常不可避免地出现,特别是当私人数据收集被公开发布或由组织分享时。为了解决这个问题,已经提出了关于隐私保留数据挖掘(PPDM)的丰富方法。由于惠珥的实用性高,近年来,防私保存的公用事业挖掘(PPUM)已成为PPDM中受欢迎的研究方向。 PPUM的主要目标是隐藏敏感的HUIS(SHUIS),以便在所产生的消毒数据库中没有揭示的机密信息。然而,通过执行数据库扰动,所有先前提出的方法都遭受了引入众多副作用的缺陷。为了缓解本问题,本文提出了一种基于整数线性编程(ILP)的新算法,以获得隐藏过程中产生的副作用的较低比率,同时在消毒数据库中没有揭示任何敏感信息。我们将隐藏过程制定为约束满足问题(CSP),追求舒斯的保护以及副作用的最小化。预期通过利用ILP技术来解决映射问题的覆盖问题来获得躲藏问题的解决方案,该方法适当地指示扰动操作的处理方式。另外,在设计的算法中还采用了弛豫程序,以提供当最佳的算法的近似解。我们所提出的方法和其他国家的广泛的实验评估在几个真实的数据集上进行了最新的算法。比较结果展示了所提出的算法关于运行时间的优越性和最小化副作用的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号