首页> 外文会议>International Conference on Fuzzy Systems and Knowledge Discovery >An improved parallel association rules algorithm based on MapReduce framework for big data
【24h】

An improved parallel association rules algorithm based on MapReduce framework for big data

机译:基于MapReduce框架的大数据并行关联规则改进算法

获取原文

摘要

Association rules mining is one of the most popular and significant issue in data mining and intends to discovery interest relations between variables in database. In our paper, we implemented an improved parallel Apriori algorithm which realized both count and candidate generation steps under MapReduce framework, while existing parallel Apriori algorithm only considered count step. We analyzed the time complexity of our improved parallel algorithm and compared to the original parallel algorithm, which indicates advantages of our algorithm with massive candidate item sets. Based on our experiment result, we proved that our algorithm performs better under big data situation and achieves excellent speedup feature.
机译:关联规则挖掘是数据挖掘中最流行,最重要的问题之一,它旨在发现数据库变量之间的兴趣关系。在本文中,我们实现了一种改进的并行Apriori算法,该算法在MapReduce框架下实现了计数和候选生成步骤,而现有的并行Apriori算法仅考虑了计数步骤。我们分析了改进后的并行算法的时间复杂度,并与原始并行算法进行了比较,这表明我们的算法具有大量候选项目集的优势。根据我们的实验结果,我们证明了该算法在大数据情况下性能更好,并具有出色的加速功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号