【24h】

Discovering long maximal frequent pattern

机译:发现长的最大频繁模式

获取原文

摘要

Association rule mining, the most commonly used method for data mining, has numerous applications. Although many approaches that can find association rules have been developed, most utilize maximum frequent itemsets that are short. Existing methods fail to perform well in applications involving large amounts of data and incur longer itemsets. Apriori-like algorithms have this problem because they generate many candidate itemsets and spend considerable time scanning databases; that is, their processing method is bottom-up and layered. This paper solves this problem via a novel hybrid Multilevel-Search algorithm. The algorithm concurrently uses the bidirectional Pincer-Search and parameter prediction mechanism along with the bottom-up search of the Parameterised method to reduce the number of candidate itemsets and consequently, the number of database scans. Experimental results demonstrate that the proposed algorithm performs well, especially when the length of the maximum frequent itemsets are longer than or equal to eight. The concurrent approach of our multilevel algorithm results in faster execution time and improved efficiency.
机译:关联规则挖掘是最常用的数据挖掘方法,具有许多应用程序。尽管已经开发了许多可以找到关联规则的方法,但是大多数方法都利用了最短的最大频繁项集。现有方法无法在涉及大量数据的应用程序中很好地执行,并且会导致项目集更长。类似Apriori的算法存在此问题,因为它们生成许多候选项目集并花费大量时间扫描数据库。也就是说,它们的处理方法是自下而上和分层的。本文通过一种新颖的混合多级搜索算法解决了这一问题。该算法同时使用双向Pincer搜索和参数预测机制以及Parameterized方法的自底向上搜索,以减少候选项目集的数量,从而减少数据库扫描的数量。实验结果表明,所提出的算法性能良好,特别是当最大频繁项集的长度大于或等于8时。我们的多级算法的并发方法可以缩短执行时间并提高效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号