首页> 外文会议>International Workshop on Applied Parallel Computing: State of the Art in Scientific Computing(PARA 2006); 20060618-21; Umea(SE) >A Web-Site-Based Partitioning Technique for Reducing Preprocessing Overhead of Parallel PageRank Computation
【24h】

A Web-Site-Based Partitioning Technique for Reducing Preprocessing Overhead of Parallel PageRank Computation

机译:基于Web站点的分区技术,可减少并行PageRank计算的预处理开销

获取原文
获取原文并翻译 | 示例

摘要

A power method formulation, which efficiently handles the problem of dangling pages, is investigated for parallelization of PageRank computation. Hypergraph-partitioning-based sparse matrix partitioning methods can be successfully used for efficient parallelization. However, the preprocessing overhead due to hypergraph partitioning, which must be repeated often due to the evolving nature of the Web, is quite significant compared to the duration of the PageRank computation. To alleviate this problem, we utilize the information that sites form a natural clustering on pages to propose a site-based hypergraph-partitioning technique, which does not degrade the quality of the parallelization. We also propose an efficient parallelization scheme for matrix-vector multiplies in order to avoid possible communication due to the pages without in-links. Experimental results on realistic datasets validate the effectiveness of the proposed models.
机译:研究一种能有效处理页面晃动问题的幂方法公式,以实现PageRank计算的并行化。基于超图分区的稀疏矩阵分区方法可以成功地用于高效并行化。但是,由于超图分区的预处理开销(由于Web的不断发展的性质而必须经常重复)与PageRank计算的持续时间相比非常重要。为了缓解此问题,我们利用站点在页面上形成自然聚类的信息来提出基于站点的超图分区技术,该技术不会降低并行化的质量。我们还为矩阵矢量乘法提出了一种有效的并行化方案,以避免由于页面没有链接而可能进行的通信。在真实数据集上的实验结果验证了所提出模型的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号