首页> 外文会议>International computer science and engineering conference >An algorithm to improve MPI-PageRank performance by reducing synchronization time
【24h】

An algorithm to improve MPI-PageRank performance by reducing synchronization time

机译:通过减少同步时间来提高MPI-PageRank性能的算法

获取原文

摘要

Ranking is an important operation in web searching. Among many ranking algorithms, PageRank is a most notable one. However, sequential PageRank computing on a large web-link graph is not efficient. To address such limitation, parallel PageRank implemented on Message Passing Interface (MPI) is a viable choice. Generally speaking, MPI-PageRank will be implemented using a root node and many computing, i.e., child, nodes. In each PageRank iteration, root node will partition web-link graph and distribute to child nodes. Then, each child node will perform PageRank on its partial web-link graph. Next, child nodes will send the result back to be combined at the root node. This operation will be performed iteratively before the ranking is converged. From the observation that when the number of nodes increase, the time to communicate between root and child nodes, i.e., synchronization time, increases rapidly such that it overcomes the benefit of parallel computing. This paper proposed an algorithm to reduce such time with a trade-off on ranking accuracy. The evaluation result show that the proposed algorithm can improve performance in term of the execution time with a bargain of accuracy.
机译:排名是网络搜索中的一项重要操作。在许多排名算法中,PageRank是最著名的算法。但是,在大型Web链接图上进行连续的PageRank计算效率不高。为了解决这种限制,在消息传递接口(MPI)上实现并行PageRank是一个可行的选择。一般来说,MPI-PageRank将使用根节点和许多计算节点(即子节点)来实现。在每个PageRank迭代中,根节点将对Web链接图进行分区并分配给子节点。然后,每个子节点将对其部分Web链接图执行PageRank。接下来,子节点将结果发送回去,以便在根节点上进行合并。在收敛排名之前,将迭代执行此操作。从观察到,当节点数量增加时,根节点与子节点之间进行通信的时间(即同步时间)迅速增加,从而克服了并行计算的优势。本文提出了一种在排序精度上进行权衡的减少时间的算法。评估结果表明,所提算法可以在执行时间方面提高性能,且价格合理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号