Abstract Combining Apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem
首页> 外文期刊>Information Sciences: An International Journal >Combining Apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem
【24h】

Combining Apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem

机译:组合Apriori启发式和生物启发算法来解决频繁的符合矿物挖掘问题

获取原文
获取原文并翻译 | 示例
           

摘要

Abstract Exact approaches to Frequent Itemsets Mining (FIM) are characterised by poor runtime performance when dealing with large database instances. Several FIM bio-inspired approaches have been proposed to overcome this issue. These are considerably more efficient from the point of view of runtime performance, but they still yield poor quality solutions. The quality of the solution, i.e., the number of frequent itemsets discovered, can be increased by improving the randomised search of the solutions space considering intrinsic features of the FIM problem. This paper proposes a new framework for FIM bio-inspired approaches that considers the recursive property of frequent itemsets, i.e., the same feature exploited by the Apriori exact heuristic, in the search of the solution space. We define two new approaches to FIM, namely GA-Apriori and PSO-Apriori, based on the proposed framework, which use genetic algorithms and particle swarm optimisation, respectively. Extensive experiments on synthetic and real database instances show that the proposed approaches outperform other bio-inspired ones in terms of runtime performance. The results also reveal that the performance of PSO-Apriori is comparable to the one of exact approaches Apriori and FPGrowth in respect of the quality of solutions found. We also show that PSO-Apriori outperforms the recently developed BATFIM algorithm when dealing with very large database instances. ]]>
机译:<![cdata [ 抽象 频繁项目集的精确方法挖掘(FIM)的特点是在处理大型数据库实例时的运行时性能不佳。已经提出了几种FIM生物启发方法来克服这个问题。这些从运行时性能的角度来看,它们仍然具有较差的质量解决方案。通过改进CIM问题的内在特征,可以通过改进解决方案空间的随机搜索来增加解决方案的质量,即发现的频繁项目集的数量。本文提出了一种新的生物启发方法的新框架,该方法考虑了频繁项目集的递归特性,即Apriori精确启发式的相同功能,在寻找解决方案空间中。基于所提出的框架,我们将两种新方法定义了FIM,即GA-APRIORI和PSO-APRIORI,分别使用遗传算法和粒子群优化。对综合和真实数据库实例的广泛实验表明,在运行时性能方面,该提议的方法优于其他生物启发。结果还表明,PSO-APRiori的性能与APRiori和FPGROW在发现的溶液质量方面的性能相当。我们还表明,PSO-APRIORI在处理非常大的数据库实例时,最近开发的BATFIM算法优于最近开发的BATFIM算法。 ]]>

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号