首页> 外文期刊>Journal of Computers >A Parallel Clustering Algorithm with MPI – MKmeans
【24h】

A Parallel Clustering Algorithm with MPI – MKmeans

机译:具有MPI - Mkmeans的并行聚类算法

获取原文
获取外文期刊封面目录资料

摘要

— Clustering is one of the most popular methods for exploratory data analysis, which is prevalent in many disciplines such as image segmentation, bioinformatics, pattern recognition and statistics etc. The most famous clustering algorithm is K-means because of its easy implementation, simplicity, efficiency and empirical success. However, the real-world applications produce huge volumes of data, thus, how to efficiently handle of these data in an important mining task has been a challenging and significant issue. In addition, MPI (Message Passing Interface) as a programming model of message passing presents high performances, scalability and portability. Motivated by this, a parallel K-means clustering algorithm with MPI, called MKmeans, is proposed in this paper. The algorithm enables applying the clustering algorithm effectively in the parallel environment. Experimental study demonstrates that MKmeans is relatively stable and portable, and it performs with low overhead of time on large volumes of data sets.
机译:- 群集是探索数据分析最流行的方法之一,这在许多学科中普遍存在,如图像分割,生物信息学,模式识别和统计数据等。最着名的聚类算法是K-Meanse,因为它易于实现,简单,效率和经验成功。然而,现实世界应用程序产生巨大的数据,因此,如何在重要的采矿任务中有效地处理这些数据一直是一个具有挑战性和重要的问题。此外,MPI(消息传递接口)作为消息传递的编程模型具有高性能,可伸缩性和可移植性。由此,在本文中提出了一种并行K-Means聚类算法,其中包含MPI,称为Mkmeans。该算法使得能够在并行环境中有效地应用聚类算法。实验研究表明,MKMeans是相对稳定和便携式的,并且它在大量数据集上执行了低的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号