首页> 外文期刊>Turkish Journal of Electrical Engineering and Computer Sciences >Smart frequent itemsets mining algorithm based on FP-tree and DIFFset data structures
【24h】

Smart frequent itemsets mining algorithm based on FP-tree and DIFFset data structures

机译:基于FP-tree和DIFFset数据结构的智能频繁项集挖掘算法

获取原文
           

摘要

Association rule data mining is an important technique for finding important relationships in large datasets. Several frequent itemsets mining techniques have been proposed using a prefix-tree structure, FP-tree, a compressed data structure for database representation. The DIFFset data structure has also been shown to significantly reduce the run time and memory utilization of some data mining algorithms. Experimental results have demonstrated the efficiency of the two data structures in frequent itemsets mining. This work proposes FDM, a new algorithm based on FP-tree and DIFFset data structures for efficiently discovering frequent patterns in data. FDM can adapt its characteristics to efficiently mine long and short patterns from both dense and sparse datasets. Several optimization techniques are also outlined to increase the efficiency of FDM. An evaluation of FDM against three frequent itemset data mining algorithms, dEclat, FP-growth, and FDM* (FDM without optimization), was performed using datasets having both long and short frequent patterns. The experimental results show significant improvement in performance compared to the FP-growth, dEclat, and FDM* algorithms.
机译:关联规则数据挖掘是一种在大型数据集中查找重要关系的重要技术。已经提出了使用前缀树结构,FP树(一种用于数据库表示的压缩数据结构)的几种频繁项集挖掘技术。还显示了DIFFset数据结构可显着减少某些数据挖掘算法的运行时间和内存利用率。实验结果证明了这两种数据结构在频繁项集挖掘中的效率。这项工作提出了FDM,这是一种基于FP-tree和DIFFset数据结构的新算法,可以有效地发现数据中的频繁模式。 FDM可以适应其特征,以从密集和稀疏数据集中有效地挖掘长短模式。还概述了几种优化技术,以提高FDM的效率。使用具有长期和短期频繁模式的数据集,针对dEclat,FP-growth和FDM *(未经优化的FDM)这三种频繁项数据挖掘算法对FDM进行了评估。实验结果表明,与FP-growth,dEclat和FDM *算法相比,性能有了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号