【24h】

Parallel PageRank Algorithm Using MapReduce

机译:使用MapReduce的并行PageRank算法

获取原文

摘要

PageRank algorithm is used to determine the relevance of a web page based on the user query. It is among one of the most effective algorithms for working out on the relevance of website pages across the globe. In the world of computation, the web is a large dataset represented in the structure as a graph with billions of hubs and links that represent various features of the web page. It is difficult to find the relevance of pages and takes a large amount of time as it uses a huge number of calculations. Using basic PageRank algorithm, causes the main problem to measure the PageRank of the web page, to evaluate this problem the algorithm can be parallelized to achieve higher efficiency in terms of time, speed and accuracy. This basic algorithm of PageRank can be implemented using MapReduce in Hadoop Framework, which results in the Parallel PageRank algorithm using MapReduce works efficiently in terms of time, speed and accuracy. Our experimental results performed on different clusters deliveries higher efficiency.
机译:PageRank算法用于根据用户查询确定网页的相关性。它是解决全球网站页面相关性的最有效算法之一。在计算世界中,Web是一个大型数据集,在结构中以图表的形式表示,其中有数十亿个代表网页各种功能的集线器和链接。很难找到页面的相关性,并且要花费大量的时间,因为它需要使用大量的计算。使用基本的PageRank算法,导致主要问题是测量网页的PageRank,为了评估该问题,可以并行化算法以在时间,速度和准确性方面实现更高的效率。 PageRank的这种基本算法可以在Hadoop框架中使用MapReduce来实现,这导致使用MapReduce的并行PageRank算法在时间,速度和准确性方面均有效地工作。我们在不同集群上执行的实验结果提供了更高的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号