首页> 外文期刊>Procedia Computer Science >G-DBSCAN: A GPU Accelerated Algorithm for Density-based Clustering
【24h】

G-DBSCAN: A GPU Accelerated Algorithm for Density-based Clustering

机译:G-DBSCAN:基于密度的群集的GPU加速算法

获取原文
           

摘要

With the advent of Web 2.0, we see a new and differentiated scenario: there is more data than that can be effectively analyzed. Organizing this data has become one of the biggest problems in Computer Science. Many algorithms have been proposed for this purpose, highlighting those related to the Data Mining area, specifically the clustering algorithms. However, these algo- rithms are still a computational challenge because of the volume of data that needs to be processed. We found in the literature some proposals to make these algorithms feasible, and, recently, those related to parallelization on graphics processing units (GPUs) have presented good results. In this work we present the G-DBSCAN, a GPU parallel version of one of the most widely used clustering algorithms, the DBSCAN. Although there are other parallel versions of this algorithm, our technique distinguishes itself by the simplicity with which the data are indexed, using graphs, allowing various parallelization opportu- nities to be explored. In our evaluation we show that the G-DBSCAN using GPU, can be over 100xfaster than its sequential version using CPU.
机译:随着Web 2.0的出现,我们看到了一种新的与众不同的方案:存在的数据量超出了可以有效分析的范围。组织这些数据已成为计算机科学中最大的问题之一。为此,已经提出了许多算法,重点介绍了与数据挖掘领域有关的算法,尤其是聚类算法。但是,由于需要处理的数据量很大,因此这些算法仍然是计算难题。我们在文献中发现了使这些算法可行的一些建议,并且最近,与图形处理单元(GPU)上的并行化相关的那些建议已显示出良好的效果。在这项工作中,我们介绍了G-DBSCAN,这是最广泛使用的集群算法之一DBSCAN的GPU并行版本。尽管该算法还有其他并行版本,但我们的技术通过使用图形对数据进行索引的简单性来区分自己,从而可以探索各种并行化机会。在我们的评估中,我们表明使用GPU的G-DBSCAN可以比使用CPU的顺序版本快100倍以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号