基于Hadoop二阶段并行模糊c-Means聚类算法

胡吉朝; 黄红艳

首页> 中文期刊> 《计算机应用与软件》 >基于Hadoop二阶段并行模糊c-Means聚类算法

基于Hadoop二阶段并行模糊c-Means聚类算法

AI论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Aiming at the problem of too high occupancy of communication time and limited applying value of the algorithm under the mechanism of Mapreduce,we put forward a Hadoop-based two-stage parallel c-Means clustering algorithm to deal with the problem of extra-large data classification.First,we improved the MPI communication management method in Mapreduce mechanism,and used membership management protocol mode to realise the synchronisation of members management and Mapreduce reducing operation.Secondly, we implemented typical individuals group reducing operation instead of global individual reducing operation,and defined the two-stage buffer algorithm.Finally,through the buffer in first stage we further reduced the data amount of Mapreduce operation in second stage,and reduced the negative impact brought about by big data on the algorithm as much as possible.Based on this,we carried out the simulation by using artificial big data test set and KDD CUP 99 invasion test data.Experimental result showed that the algorithm could both guarantee the clustering precision requirement and speed up effectively the operation efficiency of algorithm.%针对Mapreduce机制下算法通信时间占用比过高，实际应用价值受限的情况，提出基于Hadoop二阶段并行c-Means聚类算法用来解决超大数据的分类问题。首先，改进Mapreduce机制下的MPI通信管理方法，采用成员管理协议方式实现成员管理与Mapreduce降低操作的同步化；其次，实行典型个体组降低操作代替全局个体降低操作，并定义二阶段缓冲算法；最后，通过第一阶段的缓冲进一步降低第二阶段Mapreduce操作的数据量，尽可能降低大数据带来的对算法负面影响。在此基础上，利用人造大数据测试集和KDD CUP 99入侵测试集进行仿真，实验结果表明，该算法既能保证聚类精度要求又可有效加快算法运行效率。

著录项

来源
《计算机应用与软件》 |2016年第6期|282-286|共5页
作者
胡吉朝; 黄红艳;
展开▼
作者单位

石家庄经济学院信息工程学院河北石家庄050031;

石家庄经济学院信息工程学院河北石家庄050031;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序语言、算法语言;
关键词
二阶段; 模糊c-Means; 大数据; 聚类; 并行; 入侵检测;

相似文献

中文文献
外文文献
专利

1. 基于Hadoop二阶段并行模糊c-Means数据聚类算法 [J] . 高献卫 ,师智斌 . 计算机测量与控制 . 2015,第003期
2. 基于模糊c-means与自适应粒子群优化的模糊聚类算法 [J] . 耿宗科 ,王长宾 ,张振国 . 计算机科学 . 2016,第008期
3. 基于粗糙集理论的模糊C-means高维数据聚类算法 [J] . 朱付保 ,徐显景 ,白庆春 . 华中师范大学学报（自然科学版） . 2015,第004期
4. 基于模糊C-means的多视角聚类算法 [J] . 杨欣欣 ,黄少滨 . 中南大学学报（自然科学版） . 2015,第006期
5. 基于Hadoop平台的K-means聚类算法并行化改进研究 [J] . 禤世丽 ,刘建明 . 玉林师范学院学报 . 2020,第3期
6. 基于Hadoop的k-means聚类算法并行实现 [C] . 顾嘉伟 ,尚俊娜 . 浙江省信号处理学会2015学术年会 . 2015
7. 基于Hadoop的并行小波聚类算法 [A] . 欧炳华 . 2015

基于Hadoop二阶段并行模糊c-Means聚类算法

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅