...
首页> 外文期刊>Knowledge and information systems >A pattern decomposition algorithm for data mining of frequent patterns
【24h】

A pattern decomposition algorithm for data mining of frequent patterns

机译:一种频繁模式数据挖掘的模式分解算法

获取原文
获取原文并翻译 | 示例
           

摘要

Efficient algorithms to mine frequent patterns are crucial to many tasks in data mining. Since the Apriori algorithm was proposed in 1994, there have been several methods proposed to improve its performance. However, most still adopt its candidate set generation-and-test approach. In addition, many methods do not generate all frequent patterns, making them inadequate to derive association rules. We propose a pattern decomposition (PD) algorithm that can significantly reduce the size of the dataset on each pass, making it more efficient to mine all frequent patterns in a large dataset. The proposed algorithm avoids the costly process of candidate set generation and saves time by reducing the size of the dataset. Our empirical evaluation shows that the algorithm outperforms Apriori by one order of magnitude and is faster than FP-tree algorithm.
机译:高效的频繁模式挖掘算法对于数据挖掘中的许多任务至关重要。自1994年提出Apriori算法以来,已经提出了几种改善其性能的方法。但是,大多数仍然采用其候选集生成和测试方法。另外,许多方法不会生成所有频繁的模式,从而使其不足以得出关联规则。我们提出了一种模式分解(PD)算法,该算法可以显着减小每次通过时数据集的大小,从而更有效地挖掘大型数据集中的所有频繁模式。所提出的算法避免了候选集生成的昂贵过程,并通过减小数据集的大小节省了时间。我们的经验评估表明,该算法的性能比Apriori高一个数量级,并且比FP-tree算法快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号