An improved parallel association rules algorithm based on MapReduce framework for big data

机译：基于MapReduce框架的大数据并行关联规则改进算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Association rules mining is one of the most popular and significant issue in data mining and intends to discovery interest relations between variables in database. In our paper, we implemented an improved parallel Apriori algorithm which realized both count and candidate generation steps under MapReduce framework, while existing parallel Apriori algorithm only considered count step. We analyzed the time complexity of our improved parallel algorithm and compared to the original parallel algorithm, which indicates advantages of our algorithm with massive candidate item sets. Based on our experiment result, we proved that our algorithm performs better under big data situation and achieves excellent speedup feature.

机译：关联规则挖掘是数据挖掘中最流行，最重要的问题之一，它旨在发现数据库变量之间的兴趣关系。在本文中，我们实现了一种改进的并行Apriori算法，该算法在MapReduce框架下实现了计数和候选生成步骤，而现有的并行Apriori算法仅考虑了计数步骤。我们分析了改进后的并行算法的时间复杂度，并与原始并行算法进行了比较，这表明我们的算法具有大量候选项目集的优势。根据我们的实验结果，我们证明了该算法在大数据情况下性能更好，并具有出色的加速功能。

著录项

来源
《International Conference on Fuzzy Systems and Knowledge Discovery》|2014年|284-288|共5页
会议地点
作者
Zhou Xinhao; Huang Yongfeng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Algorithm design and analysis; Association rules; Big data; Clustering algorithms; Databases; Time complexity; Apriori; Association Rules; Data Mining; Hadoop; MapReduce;

机译：算法设计与分析;关联规则;大数据;聚类算法;数据库;时间复杂度; Apriori;关联规则;数据挖掘; Hadoop; MapReduce;

相似文献

外文文献
中文文献
专利

1. High Performance Computation of Big Data: Performance Optimization Approach towards a Parallel Frequent Item Set Mining Algorithm for Transaction Data based on Hadoop MapReduce Framework [J] . Guru Prasad M S, Nagesh H R, Swathi Prabhu International Journal of Intelligent Systems and Applications . 2017,第1期

机译：大数据的高性能计算：基于Hadoop MapReduce框架的事务数据并行频繁项集挖掘算法的性能优化方法
2. DCE -miner: an association rule mining algorithm for multimedia based on the MapReduce framework [J] . LI Chengyan, Shixiang FENG, Guanglu SUN Multimedia Tools and Applications . 2020,第23a24期

机译：DCE -Miner：基于MapReduce框架的多媒体关联规则挖掘算法
3. A MapReduce-Based Parallel Frequent Pattern Growth Algorithm for Spatiotemporal Association Analysis of Mobile Trajectory Big Data [J] . Xia Dawen, Lu Xiaonan, Li Huaqing, Complexity . 2018,第1期

机译：基于MapReduce的并行频繁模式增长算法用于移动轨迹大数据的时空关联分析
4. An improved parallel association rules algorithm based on MapReduce framework for big data [C] . Zhou Xinhao, Huang Yongfeng International Conference on Fuzzy Systems and Knowledge Discovery . 2014

机译：基于MapReduce框架的改进的并联关联规则算法
5. Efficient sequential and parallel algorithms for mining association rules in text databases [D] . Holt, John D. 2003

机译：用于挖掘文本数据库中关联规则的高效顺序和并行算法
6. K-mer clustering algorithm using a MapReduce framework: application to the parallelization of the Inchworm module of Trinity [O] . Chang Sik Kim, Martyn D. Winn, Vipin Sachdeva, 2017

机译：使用MapReduce框架的K-mer聚类算法：在Trinity的Inchworm模块并行化中的应用
7. A paralleled big data algorithm with mapreduce framework for mining twitter data [O] . Bing L, Chan KCC 2015

机译：带有mapreduce框架的并行大数据算法，用于挖掘Twitter数据

An improved parallel association rules algorithm based on MapReduce framework for big data

摘要

著录项

相似文献

相关主题

期刊订阅