首页> 外文会议>IASTED International Conference on Computational Intelligence >APPLYING BIT-VECTOR PROJECTION APPROACH FOR EFFICIENT MINING OF N-MOST INTERESTING FREQUENT ITEMSETS
【24h】

APPLYING BIT-VECTOR PROJECTION APPROACH FOR EFFICIENT MINING OF N-MOST INTERESTING FREQUENT ITEMSETS

机译:应用比特矢量投影方法,以实现N最有趣的频繁项目集的有效挖掘

获取原文

摘要

Real world datasets are sparse, dirty and contain hundreds of items In such situations, discovering interesting rules (results) using traditional frequent itemset mining approach by specifying a user defined input support threshold is not appropriate. Since without any domain knowledge, setting support threshold small or large can output nothing or a large number of redundant uninteresting results. Recently a novel approach of mining N-most interesting itemsets is proposed, which discovers only top N interesting results without specifying any user defined support threshold. However, mining N-most interesting itemsets are more costly in terms of itemset search space exploration and processing cost Thereby, the efficiency of mining process highly depends upon the itemset frequency (support) counting, implementation techniques and projection of relevant transactions to lower level nodes of search space. In this paper, we present a novel N-most interesting itemset mining algorithm (N-MostMiner) using the bit-vector representation approach which is very efficient in terms of itemset frequency counting and transactions projection Several efficient implementation techniques of N-MostMiner are also present which we experienced in our implementation Our different experimental results on benchmark datasets suggest that the N-MostMiner is very efficient in terms of processing time as compared to currently best algorithm BOMO.
机译:真实世界数据集是稀疏,脏污,并包含在这种情况下的数百个项目,发现使用传统的频繁项目集挖掘方法通过指定用户定义的输入支持阈值不合适,发现有趣的规则(结果)是不合适的。由于没有任何域知识,因此设置支持阈值小或大可以输出任何东西或大量冗余的无趣的结果。最近提出了一种新颖的挖掘n最有趣的项目集的方法,该方法仅在不指定任何用户定义的支持阈值的情况下发现顶部n有趣的结果。然而,在项目集搜索空间探索和处理成本方面,挖掘n最有趣的项目集更昂贵,因此,采矿过程的效率高度取决于项目集频率(支持)计数,实现技术和对较低级别节点的相关事务的投影搜索空间。在本文中,我们介绍了一种新颖的n最有趣的项目集挖掘算法(n-mostminer),使用比特 - 向量表示方法,即在项目集频率计数和事务投影方面非常有效,但是N-mostminer的几个有效的实现技术也是如此目前我们在我们的实施中经验丰富的基准数据集的不同实验结果表明,与当前最好的算法Bomo相比,N-MOSTMINER在处理时间方面非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号