首页> 外文期刊>International journal of data mining, modelling and management >Fast parallel PageRank technique for detecting spam web pages
【24h】

Fast parallel PageRank technique for detecting spam web pages

机译:用于检测垃圾网页的快速并行PageRank技术

获取原文
获取原文并翻译 | 示例
           

摘要

Brin and Larry proposed PageRank in 1998, which appears as a prevailing link analysis technique used by web search engines to rank its search results list. Computation of PageRank values in an efficient and faster manner for very immense web graph is truly an essential concern for search engines today. To identify the spam web pages and also deal with them is yet another important concern in web browsing. In this research article, an efficient and faster parallel PageRank algorithm is proposed, which harnesses the power of graphics processing units (GPUs). In proposed algorithm, the PageRank scores are non-uniformly distributes among the web pages, so it is also competent of coping with spam web pages. The experiments are performed on standard datasets available in Stanford large network dataset collection. There is a speed up of about 1.1 to 1.7 for proposed parallel PageRank algorithm over existing parallel PageRank algorithm.
机译:布林和拉里(Brin and Larry)于1998年提出PageRank,这是网络搜索引擎用来对其搜索结果列表进行排名的一种流行的链接分析技术。对于当今非常庞大的网络图,以高效,快速的方式计算PageRank值是当今搜索引擎必不可少的问题。识别垃圾邮件网页并对其进行处理是Web浏览中的另一个重要问题。在这篇研究文章中,提出了一种有效且更快的并行PageRank算法,该算法利用了图形处理单元(GPU)的功能。在提出的算法中,PageRank分数在网页之间是不均匀分布的,因此它也具有处理垃圾网页的能力。实验是在斯坦福大型网络数据集集合中可用的标准数据集上进行的。与现有的并行PageRank算法相比,建议的并行PageRank算法的速度提高了约1.1到1.7。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号