首页> 外文会议>International Conference on Big Data and Smart Computing >An efficient parallel graph clustering technique using Pregel
【24h】

An efficient parallel graph clustering technique using Pregel

机译:使用Pregel的高效并行图聚类技术

获取原文

摘要

To represent complex structure, graph data is widely used in diverse applications. Among various techniques to extract meaningful information in graph data, graph clustering is an important task for the discovery of an underlying graph structure. However, the volume of graph data becomes large and increases fast recently as well as traditional clustering algorithms become computationally expensive as the size of data to be clustered increases. In this paper, we propose an efficient graph clustering algorithm running on Pregel which is one of prominent parallel processing models for large-scale graph data. Graph clustering technique merges the nodes into clusters such that the nodes in a cluster are strongly connected each other. To seek strongly connected nodes efficiently, we utilize Min-Hash which calculates similarity between vertices and/or clusters. In our experimental study, we demonstrate the efficiency and scalability of our parallel algorithm compared to existing algorithms.
机译:为了表示复杂的结构,图形数据被广泛用于各种应用程序中。在提取图数据中有意义的信息的各种技术中,图聚类是发现基础图结构的重要任务。但是,随着要聚类的数据量的增加,图数据的数量变得越来越大,并且增长迅速,并且传统的聚类算法在计算上也变得昂贵。在本文中,我们提出了一种在Pregel上运行的有效图聚类算法,该算法是针对大型图数据的著名并行处理模型之一。图聚类技术将节点合并到群集中,从而使群集中的节点彼此牢固地连接在一起。为了有效地寻找强连接的节点,我们利用Min-Hash来计算顶点和/或簇之间的相似度。在我们的实验研究中,我们证明了与现有算法相比,并行算法的效率和可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号