Hadoop环境下基于并行熵的FIUT算法挖掘

晏依; 徐苏

首页> 中文期刊> 《计算机工程与设计》 >Hadoop环境下基于并行熵的FIUT算法挖掘

Hadoop环境下基于并行熵的FIUT算法挖掘

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Focusing on the inefficient problem of traditional algorithms for mining frequent itemsets, aparallel algorithm named Balanced_MapReduce_FIUT (BMR-FIUT) based on Hadoop platform was proposed.By introducing frequent items ultrametric tree (FIU-Tree) structure, frequent itemsets were mined, effectively avoiding the defects of the traditional algorithm.The process of decomposition was improved with FIUT algorithm to adapt to its parallel computing under the framework of MapReduce, achieving the purpose of parallelization.The parallel entropy was used as the load balance measurement in cluster system, so that system could in all reasonable to distribute data as much as possible between every nodes.Experimental results show that BMR-FIUT algorithm can effectively reduce the problem about load inclination of any node in the process of parallelization, it is superior to the existing PFP-Growth algorithm and it has better performance on mining volume big data.%针对传统频繁项集挖掘算法效率低下的问题, 提出基于Hadoop平台的并行BMR-FIUT算法.通过引入FIU-Tree (frequent items ultrametric tree) 结构挖掘频繁项集, 避免传统算法的缺陷;改进FIUT算法的分解过程, 使之适应于Map-Reduce框架下的并行计算, 达到并行化的目的;利用并行熵作为集群系统的负载均衡度量, 使系统尽可能在各节点间合理分发数据以平衡负载.实验结果表明, BMR-FIUT算法能够有效减少并行化过程中节点负载倾斜的问题, 较现有的PFP-Growth算法具有更好的性能, 适用于海量数据挖掘.

著录项

来源
《计算机工程与设计》 |2019年第3期|685-690787|共7页
作者
晏依; 徐苏;
展开▼
作者单位

南昌大学信息工程学院;

江西南昌 330031;

南昌大学信息工程学院;

江西南昌 330031;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
数据挖掘; 频繁项集; MapReduce编程模型; FIUT算法; 并行熵; 负载均衡;

相似文献

中文文献
外文文献
专利

1. 大数据环境下基于Hadoop框架的改进Apriori挖掘算法 [J] . 曾毅 ,周湘贞 . 机床与液压 . 2019,第006期
2. 大数据环境下基于Hadoop平台的医学数据挖掘算法研究 [J] . Guang-kun JIANG . 机床与液压 . 2018,第018期
3. 大数据环境下基于Hadoop框架的数据挖掘算法的研究与实现 [J] . 洪波 ,吕燕霞 ,黄磊 . 电子设计工程 . 2017,第007期
4. 大数据环境下基于Hadoop框架的数据挖掘算法的研究与实现 [J] . 周佩 ,朱娴睿 ,何汉 . 电脑知识与技术 . 2017,第029期
5. 基于信息熵与遗传算法的并行关联规则增量挖掘算法 [J] . 毛伊敏 ,邓千虎 ,陈志刚 . 通信学报 . 2021,第005期
6. 云环境下基于二进制编码聚类的并行频繁项集挖掘算法 [C] . LIU Bo ,刘博 ,LI Yun . 2012年江苏省人工智能学术会议 . 2012
7. 基于Hadoop平台并行关联规则挖掘算法研究 [A] . 张玲 . 2017

Hadoop环境下基于并行熵的FIUT算法挖掘

摘要

著录项

相似文献

相关主题

期刊订阅