首页> 外文期刊>Journal of Parallel and Distributed Computing >Parallel WaveCluster: A linear scaling parallel clustering algorithm implementation with application to very large datasets
【24h】

Parallel WaveCluster: A linear scaling parallel clustering algorithm implementation with application to very large datasets

机译:并行WaveCluster:一种线性缩放并行聚类算法实现,适用于非常大的数据集

获取原文
获取原文并翻译 | 示例

摘要

A linear scaling parallel clustering algorithm implementation and its application to very large datasets for cluster analysis is reported. WaveCluster is a novel clustering approach based on wavelet transforms. Despite this approach has an ability to detect clusters of arbitrary shapes in an efficient way, it requires considerable amount of time to collect results for large sizes of multi-dimensional datasets. We propose the parallel implementation of the WaveCluster algorithm based on the message passing model for a distributed-memory multiprocessor system. In the proposed method, communication among processors and memory requirements are kept at minimum to achieve high efficiency.- We have conducted the experiments on a dense dataset and a sparse dataset to measure the algorithm behavior appropriately. Our results obtained from performed experiments demonstrate that developed parallel WaveCluster algorithm exposes high speedup and scales linearly with the increasing number of processors.
机译:报告了线性缩放并行聚类算法的实现及其在用于聚类分析的超大型数据集上的应用。 WaveCluster是一种基于小波变换的新颖聚类方法。尽管此方法具有以有效方式检测任意形状的聚类的能力,但仍需要大量时间来收集大型多维数据集的结果。我们建议基于消息传递模型的分布式内存多处理器系统的WaveCluster算法的并行实现。在所提出的方法中,保持处理器之间的通信和内存需求最小化以实现高效率。-我们在密集数据集和稀疏数据集上进行了实验,以适当地测量算法行为。我们从执行的实验中获得的结果表明,开发的并行WaveCluster算法可实现高加速,并随着处理器数量的增加而线性扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号