首页> 外文会议>International Conference on Big Data and Smart Computing >An efficient parallel graph clustering technique using Pregel
【24h】

An efficient parallel graph clustering technique using Pregel

机译:一种使用PREGEL的有效的并行图形聚类技术

获取原文

摘要

To represent complex structure, graph data is widely used in diverse applications. Among various techniques to extract meaningful information in graph data, graph clustering is an important task for the discovery of an underlying graph structure. However, the volume of graph data becomes large and increases fast recently as well as traditional clustering algorithms become computationally expensive as the size of data to be clustered increases. In this paper, we propose an efficient graph clustering algorithm running on Pregel which is one of prominent parallel processing models for large-scale graph data. Graph clustering technique merges the nodes into clusters such that the nodes in a cluster are strongly connected each other. To seek strongly connected nodes efficiently, we utilize Min-Hash which calculates similarity between vertices and/or clusters. In our experimental study, we demonstrate the efficiency and scalability of our parallel algorithm compared to existing algorithms.
机译:表示复杂结构,图数据广泛用于各种应用。在图形数据中提取有意义信息的各种技术中,图形聚类是发现底层图形结构的重要任务。然而,图表数据的体积变大并且最近增加,以及传统的聚类算法随着要聚集的数据的大小增加而变得计算地昂贵。在本文中,我们提出了一种在PREGER上运行的高效图形聚类算法,该算法是大规模图数据的突出并行处理模型之一。图形聚类技术将节点合并到群集中,使得集群中的节点相互连接。为了有效地寻求强烈连接的节点,我们利用MIN-HASH,该散列计算顶点和/或簇之间的相似性。在我们的实验研究中,我们展示了与现有算法相比并行算法的效率和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号