首页> 美国卫生研究院文献>Computational Intelligence and Neuroscience >On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms
【2h】

On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms

机译:基于GPGPU的增量聚类算法的准确性和并行性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to meet this demand. Moreover, General Purpose Graphic Processing Unit (GPGPU) is a promising parallel computing device. Nevertheless, the incremental clustering algorithm is facing a dilemma between clustering accuracy and parallelism when they are powered by GPGPU. We formally analyzed the cause of this dilemma. First, we formalized concepts relevant to incremental clustering like evolving granularity. Second, we formally proved two theorems. The first theorem proves the relation between clustering accuracy and evolving granularity. Additionally, this theorem analyzes the upper and lower bounds of different-to-same mis-affiliation. Fewer occurrences of such mis-affiliation mean higher accuracy. The second theorem reveals the relation between parallelism and evolving granularity. Smaller work-depth means superior parallelism. Through the proofs, we conclude that accuracy of an incremental clustering algorithm is negatively related to evolving granularity while parallelism is positively related to the granularity. Thus the contradictory relations cause the dilemma. Finally, we validated the relations through a demo algorithm. Experiment results verified theoretical conclusions.
机译:增量聚类算法在各种应用程序中扮演着至关重要的角色,例如海量数据分析和实时数据处理。增量集群的典型应用场景对硬件平台的计算能力提出了很高的要求。并行计算是满足此需求的通用解决方案。此外,通用图形处理单元(GPGPU)是有前途的并行计算设备。然而,当由GPGPU提供支持时,增量聚类算法将面临聚类精度和并行性之间的难题。我们正式分析了造成这种困境的原因。首先,我们将与渐进式聚类相关的概念形式化,例如进化粒度。其次,我们正式证明了两个定理。第一个定理证明了聚类精度和演化粒度之间的关系。此外,该定理分析了不同到相同的误隶属关系的上限和下限。发生这种隶属关系的次数越少,意味着精度越高。第二个定理揭示了并行性和不断发展的粒度之间的关系。较小的工作深度意味着更高的并行度。通过证明,我们得出结论,增量聚类算法的准确性与演进的粒度负相关,而并行性与粒度正相关。因此,矛盾的关系造成了两难境地。最后,我们通过演示算法验证了这种关系。实验结果验证了理论结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号