首页> 外文期刊>Cluster computing >A novel Bit Vector Product algorithm for mining frequent itemsets from large datasets using MapReduce framework
【24h】

A novel Bit Vector Product algorithm for mining frequent itemsets from large datasets using MapReduce framework

机译:使用MapReduce框架从大型数据集中挖掘频繁项目集的新型比特矢量产品算法

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent itemset mining (FIM) is an interesting sub-area of research in the field of Data Mining. With the increase in the size of datasets, conventional FIM algorithms are not suitable and efforts are made to migrate to the Big Data Frameworks for designing algorithms using MapReduce like computing paradigms.We too interested in designing MapReduce based algorithm. Initially, our Parallel Compression algorithm makes data simpler to handle. A novel bit vector data structure is proposed to maintain compressed transactions and it is formed by scanning the dataset only once. Our Bit Vector Product algorithm follows the MapReduce approach and effectively searches for frequent itemsets from a given list of transactions. The experimental results are present to prove the efficacy of our approach over some of the recent works.
机译:频繁的项目集挖掘(FIM)是数据挖掘领域的一个有趣的子区域。 随着数据集的大小的增加,传统的FIM算法不适合,并使努力迁移到使用MapReduce等计算范例设计的算法的大数据框架。我们太感兴趣地设计了基于MapReduce的算法。 最初,我们的并行压缩算法使数据更简单地处理。 提出了一种新颖的比特矢量数据结构以维持压缩事务,并且通过仅扫描一次数据集来形成。 我们的位矢量产品算法遵循MapReduce方法,从给定的事务列表中有效地搜索频繁的项目集。 存在实验结果,以证明我们对最近一些作品的方法的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号