首页> 外文期刊>Physica, A. Statistical mechanics and its applications >PSPLPA: Probability and similarity based parallel label propagation algorithm on spark
【24h】

PSPLPA: Probability and similarity based parallel label propagation algorithm on spark

机译:PSPLPA:基于概率和相似性并行标签传播算法

获取原文
获取原文并翻译 | 示例
       

摘要

With the rapid growth of social network, the cost of computation is increasing. Many existing algorithms are not suitable for the large-scale data. Apache Spark is an open source cluster computing framework that empowers us to solve the problem of community detection in a cluster of computer. In this paper, we propose a novel label propagation algorithm on Spark, called PSPLPA (Probability and similarity based Parallel label propagation algorithm). PSPLPA employs a new label updating strategy using probability in the label propagation procedure during each iteration. First, weight calculation, which is based on k-shell, is integrated into the label initialization process. Second, parallel propagation steps are comprehensively proposed to utilize label probability efficiently. Third, randomness in label updating is significantly reduced via automatic label selection and similarity computation. Experiments conducted on artificial and real social networks demonstrate that the proposed algorithm exhibits high scalability and high accuracy. (C) 2018 Elsevier B.V. All rights reserved.
机译:随着社交网络的快速增长,计算成本正在增加。许多现有算法不适用于大规模数据。 Apache Spark是一个开源集群计算框架,使我们能够解决一组计算机中的社区检测问题。在本文中,我们提出了一种关于火花的新标签传播算法,称为PSPLPA(基于概率和相似性的并行标签传播算法)。 PSPLPA在每次迭代期间使用标签传播过程中的概率使用新的标签更新策略。首先,基于K-shell的权重计算集成到标签初始化过程中。其次,综合地提出了并行传播步骤以有效地利用标签概率。第三,通过自动标签选择和相似性计算显着减少了标签更新中的随机性。在人造和真实社交网络上进行的实验表明,所提出的算法表现出高可扩展性和高精度。 (c)2018年elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号