首页> 外文会议>Computational Intelligence and Data Mining, 2009. CIDM '09 >Maintaining only frequent itemsets to mine approximate frequent itemsets over online data streams
【24h】

Maintaining only frequent itemsets to mine approximate frequent itemsets over online data streams

机译:仅维护频繁项目集以通过在线数据流挖掘近似的频繁项目集

获取原文
获取外文期刊封面目录资料

摘要

Mining frequent itemsets over online data streams, where the new data arrive and the old data will be removed with high speed, is a challenge for the computational complexity. Existing approximate mining algorithms suffer from explosive computational complexity when decreasing the error parameter, isin, which is used to control the mining accuracy. We propose a new approximate mining algorithm using an approximate frequent itemset tree (abbreviated as AFI-tree), called AFI algorithm, to mine approximate frequent itemsets over online data streams. The AFI-tree based on prefix tree maintains only frequent itemsets, so the number of nodes in the tree is very small. All the infrequent child nodes of any frequent node are pruned and the maximal support of the pruned nodes is estimated to detect new frequent itemsets. In order to guarantee the mining accuracy, when the estimated maximal support of the pruned nodes is a bit more than the minimum support, their supports will be re-computed and the frequent nodes among them will be inserted into the AFI-tree. Experimental results show that the AFI algorithm consumes much less memory space than existing algorithms, and runs much faster than existing algorithms in most occasions.
机译:在在线数据流上挖掘频繁的项目集(新数据到达那里,旧数据将被高速删除)对于计算复杂性是一个挑战。现有的近似挖掘算法在降低误差参数isin时会遭受爆炸性的计算复杂性,该误差参数用于控制挖掘精度。我们提出一种使用近似频繁项集树(简称为AFI-tree)的新近似挖掘算法,称为AFI算法,以通过在线数据流挖掘近似频繁项集。基于前缀树的AFI树仅维护频繁的项目集,因此树中的节点数非常少。修剪任何频繁节点的所有不频繁子节点,并估计所修剪节点的最大支持以检测新的频繁项目集。为了保证挖掘的准确性,当修剪后的节点的估计最大支持量大于最小支持量时,将重新计算它们的支持量,并将其中的频繁节点插入AFI树中。实验结果表明,AFI算法消耗的内存空间比现有算法少得多,并且在大多数情况下运行速度都比现有算法快得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号