首页> 中文期刊>计算机应用研究 >基于MapRed uce的 FCM聚类集成算法

基于MapRed uce的 FCM聚类集成算法

     

摘要

模糊C-均值(FCM)聚类集成算法是一种利用集成思想提高聚类质量的方法。针对FCM聚类集成算法随着数据量的增加时间复杂度过高的问题,提出一种基于MapReduce框架的并行FCM聚类集成算法。首先利用多组随机的初始聚类中心分别进行聚类来获取具有差异化的聚类成员;然后通过建立聚类成员簇间OVER-LAP矩阵来寻找逻辑等价簇,统一所有聚类成员中的簇标记;最后利用投票法共享聚类成员中数据对象的聚类情况得出最终的聚类结果。实验结果表明,该算法具有良好的精确度、加速比和扩展性,具有处理较大规模数据集的能力。%Fuzzy C-means(FCM)clustering ensemble algorithm is a kind of method to improve the quality of clustering by using the integrated thought,but with the increased amount of data the time complexity will be increased.This paper proposed a parallel FCM clustering ensemble algorithm based on MapReduce.This algorithm used random initial cluster centers to get differential clustering members.It unified the cluster labels by establishing an OVERLAP matrix among the clusters to find the logical equivalence clusters.The final clustering results were obtained by voting that shared the information of the classification of the data object in clustering members.The experimental results show that the parallel FCM clustering ensemble algorithm has good precision and the advantages of both high speedup and good scalability.As a result,this algorithm has the ability of dealing with large-scale data set.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号