...
首页> 外文期刊>Sadhana: Academy Proceedings in Engineering Science >An efficient approach based on selective partitioning for maximal frequent itemsets mining
【24h】

An efficient approach based on selective partitioning for maximal frequent itemsets mining

机译:基于最大频繁项目集采矿的选择性分区的一种有效方法

获取原文
获取原文并翻译 | 示例
           

摘要

We present a maximal frequent itemset (MFI) mining algorithm based on selective partitioning called SelPMiner. It makes use of a novel data format named Itemset-count tree-a compact and optimized representation in the form of partition that reduces memory requirement. It also does selective partitioning of the database, which reduces runtime to scan database. As the algorithm progressively searches for longer frequent itemsets in a depth-first manner, it creates new partitions with even smaller sizes having less dimensions and unique data instances, which results in faster support counting. SelPMiner uses a number of optimizations to prune the search space. We also prove upper bounds on the amount of memory consumed by these partitions. Experimental comparisons of the SelPMiner algorithm with popular existing fastest MFI mining algorithms on different types of datasets show significant speedup in computation time for many cases. SelPMiner works especially well when the minimum support is low and consumes less memory.
机译:我们提出了一种基于名为SELPMINER的选择性分区的最大频繁的项目集(MFI)挖掘算法。它利用名为itemset-count树的新型数据格式 - 以缩小内存要求的分区形式的Compact且优化表示。它还可以选择性分区数据库,这减少了运行时到扫描数据库。随着算法以深度第一方式逐渐搜索更长频繁的项目集,它创建具有较小尺寸和唯一数据实例的更小的尺寸的新分区,从而导致更快的支持计数。 Selpminer使用许多优化来修剪搜索空间。我们还在这些分区消耗的内存量上证明了上限。在不同类型数据集上具有流行现有最快的MFI挖掘算法的SELPMINER算法的实验比较显示了许多情况下的计算时间中的显着加速。当最小支持低并且消耗更少的内存时,SELPMINER特别好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号