首页> 外文会议>2010 2nd International Conference on Future Computer and Communication >Mining maximal patterns based on improved FP-tree and array technique
【24h】

Mining maximal patterns based on improved FP-tree and array technique

机译:基于改进的FP-tree和数组技术挖掘最大模式

获取原文

摘要

Mining frequent itemsets is very important for mining association rules. However, because of the inherent complexity, mining complete frequent patterns from a dense database could be impractical, and the quantity of the mined patterns is usually very large. It is hard to understand and make use of them. Maximal frequent patterns contain and compress all frequent patterns, and the memory needed for saving them is much smaller than that needed for saving complete patterns. Thus it is greatly valuable to mine maximal frequent patterns. In this paper, the structure of a traditional FP-tree is improved and an efficient algorithm for mining maximal frequent patterns based on improved FP-tree and array technique, called IAFP-max, is presented. By introducing the concept of postfix sub-tree, the presented algorithm needn''t generate the candidate of maximal frequent patterns in mining process and therefore greatly reduces the memory consume, and it also uses an array-based technique to reduce the traverse time to the improved FP-tree. The experimental evaluation shows that this algorithm outperforms most exiting algorithms MAFIA, GenMax and FPmax.
机译:挖掘频繁项集对于挖掘关联规则非常重要。但是,由于固有的复杂性,从密集的数据库中挖掘完整的频繁模式可能是不切实际的,并且挖掘的模式的数量通常非常大。很难理解和利用它们。最大的频繁模式包含并压缩所有频繁模式,并且保存它们所需的内存比保存完整模式所需的内存小得多。因此,挖掘最大的频繁模式非常有价值。本文对传统FP树的结构进行了改进,提出了一种基于改进FP树和数组技术的最大频繁模式挖掘算法IAFP-max。通过引入后缀子树的概念,提出的算法不需要在挖掘过程中生成最大频繁模式的候选者,因此大大减少了内存消耗,并且还使用了基于数组的技术来减少遍历时间。改进的FP树。实验评估表明,该算法优于大多数现有算法MAFIA,GenMax和FPmax

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号