首页> 外文期刊>Expert Systems with Application >A parallel fuzzy clustering algorithm for large graphs using Pregel
【24h】

A parallel fuzzy clustering algorithm for large graphs using Pregel

机译:使用Pregel的大型图的并行模糊聚类算法

获取原文
获取原文并翻译 | 示例
           

摘要

Large graphs are scale free and ubiquitous having irregular relationships. Clustering is used to find existent similar patterns in graphs and thus help in getting useful insights. In real-world, nodes may belong to more than one cluster thus, it is essential to analyze fuzzy cluster membership of nodes. Traditional centralized fuzzy clustering algorithms incur high communication cost and produce poor quality of clusters when used for large graphs. Thus, scalable solutions are obligatory to handle huge amount of data in less computational time with minimum disk access. In this paper, we proposed a parallel fuzzy clustering algorithm named 'PGFC' for handling scalable graph data. It will be advantageous from the viewpoint of expert systems to develop a clustering algorithm that can assure scalability along with better quality of clusters for handling large graphs.The algorithm is parallelized using bulk synchronous parallel (BSP) based Pregel model. The cluster centers are initialized using degree centrality measure, resulting in lesser number of iterations. The performance of PGFC is compared with other state of art clustering algorithms using synthetic graphs and real world networks. The experimental results reveal that the proposed PGFC scales up linearly to handle large graphs and produces better quality of clusters when compared to other graph clustering counterparts. (C) 2017 Elsevier Ltd. All rights reserved.
机译:大图是无标度的,并且普遍存在不规则关系。聚类用于查找图形中存在的相似模式,从而有助于获得有用的见解。在现实世界中,节点可能属于多个群集,因此,分析节点的模糊群集成员关系至关重要。传统的集中式模糊聚类算法在用于大型图时会产生较高的通信成本,并且会导致聚类质量下降。因此,可伸缩的解决方案必须以最少的磁盘访问量在更少的计算时间内处理大量数据。在本文中,我们提出了一种并行模糊聚类算法,称为“ PGFC”,用于处理可伸缩图数据。从专家系统的角度来看,开发一种聚类算法将是有利的,该算法可确保可伸缩性以及用于处理大型图形的聚类的更好质量。该算法使用基于批量同步并行(BSP)的Pregel模型进行并行化。使用度中心性度量初始化聚类中心,从而减少迭代次数。使用合成图和真实世界网络将PGFC的性能与其他现有技术的聚类算法进行比较。实验结果表明,与其他图聚类对应物相比,提出的PGFC可以线性放大以处理大型图,并产生更好的聚类质量。 (C)2017 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号