【24h】

Parallelizing the Computation of PageRank

机译:并行化PageRank的计算

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents a technique we call ParaSolve that exploits the sparsity structure of the web graph matrix to improve on the degree of parallelism in a number of distributed approaches for computat-ing PageRank. Specifically, a typical algorithm (such as power iteration or GMRES) for solving the linear system corresponding to PageRank, call it LinearSolve, may be converted to a distributed algorithm, Dis-trib(LinearSolve), by partitioning the problem and applying a standard technique (i.e., Distrib). By reducing the number of inter-partition multiplications, we may greatly increase the degree of parallelism, while achieving a similar degree of accuracy. This should lead to increasingly better performance as we utilize more processors. For example, using GeoSolve (a variant of Jacobi iteration) as our linear solver and the 2001 web graph from Stanford's WebBase project, on 12 processors Para-Solve(GeoSolve) outperforms Distrib(GeoSolve) by a factor of 1.4, while on 32 processors the performance ratio improves to 2.8.
机译:本文介绍了一种称为ParaSolve的技术,该技术利用网络图矩阵的稀疏结构来提高许多用于计算PageRank的分布式方法的并行度。具体而言,可以通过对问题进行划分并应用标准技术,将用于求解与PageRank相对应的线性系统(称为LinearSolve)的典型算法(例如幂迭代或GMRES)转换为分布式算法Dis-trib(LinearSolve)。 (即分发)。通过减少分区间乘法的数量,我们可以大大提高并行度,同时达到类似的精度。随着我们使用更多处理器,这将导致性能越来越好。例如,使用GeoSolve(Jacobi迭代的一种形式)作为线性求解器,以及使用Stanford WebBase项目中的2001网络图形,在12个处理器上,Para-Solve(GeoSolve)的性能比Distrib(GeoSolve)高1.4倍,而在32个处理器上性能比提高到2.8。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号