首页> 外文会议>2011 9th International Conference on High Performance Computing Simulation >P-means, a parallel clustering algorithm for a heterogeneous multi-processor environment
【24h】

P-means, a parallel clustering algorithm for a heterogeneous multi-processor environment

机译:P-means,一种用于异构多​​处理器环境的并行集群算法

获取原文

摘要

G-means is a data mining clustering algorithm based on k-means, used to find the number of Gaussian distributions and their centers inside a multi-dimensional dataset. This paper presents the performance gain obtained from the development of a parallel G-means algorithm for a heterogeneous multi-processor environment using the StarSs framework, called here P-means. The P-means execution was divided into 6 well-defined steps, where each step was analyzed to create a hierarchical task structure in order to parallelize the execution enabling it to explore the hierarchy and heterogeneity of the Cell BE blades and others heterogeneous architectures. The algorithm implementation was also adapted to perform sequential timing measures to evaluate the Amdahl's law, to compare the theoretical calculation and the execution times' measurements and to introduce parallel computation by using the StarSs framework. The algorithm was executed using a 30 clusters dataset containing 600 thousand points of 60 dimensions in different hardware configurations in order to compare its execution time and speedup, and it showed a overall speedup of more than 18 times. A successful experimentation with real data demonstrated the usefulness of the algorithm.
机译:G均值是一种基于k均值的数据挖掘聚类算法,用于在多维数据集中查找高斯分布的数量及其中心。本文介绍了通过使用StarSs框架(在此称为P-means)为异构多处理器环境开发并行G-means算法获得的性能提升。 P均值执行分为6个明确定义的步骤,其中每个步骤都经过分析以创建分层的任务结构,以使执行并行化,从而使其能够探索Cell BE刀片和其他异构架构的层次结构和异构性。该算法的实现还适用于执行顺序计时措施,以评估阿姆达尔定律,比较理论计算和执行时间的测量结果,并通过使用StarSs框架引入并行计算。该算法使用30个集群数据集执行,该数据集包含60万个点,在不同的硬件配置下具有60个维度,以便比较其执行时间和加速,结果显示总体加速超过18倍。对真实数据的成功实验证明了该算法的实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号