首页> 外文期刊>Information Systems >Modeling the web as a hypergraph to compute page reputation
【24h】

Modeling the web as a hypergraph to compute page reputation

机译:将网络建模为超图以计算页面信誉

获取原文
获取原文并翻译 | 示例
       

摘要

In this work we propose a model to represent the web as a directed hypergraph (instead of a graph), where links connect pairs of disjointed sets of pages. The web hypergraph is derived from the web graph by dividing the set of pages into non-overlapping blocks and using the links between pages of distinct blocks to create hyperarcs. A hyperarc connects a block of pages to a single page, in order to provide more reliable information for link analysis. We use the hypergraph model to create the hypergraph versions of the Pagerank and Indegree algorithms, referred to as HyperPagerank and Hyperlndegree, respectively. The hypergraph is derived from the web graph by grouping pages by two different partition criteria: grouping together the pages that belong to the same web host or to the same web domain. We compared the original page-based algorithms with the host-based and domain-based versions of the algorithms, considering a combination of the page reputation, the textual content of the pages and the anchor text. Experimental results using three distinct web collections show that the HyperPagerank and Hyperlndegree algorithms may yield better results than the original graph versions of the Pagerank and Indegree algorithms. We also show that the hypergraph versions of the algorithms were slightly less affected by noise links and spamming.
机译:在这项工作中,我们提出了一个模型来将网络表示为有向超图(而不是图),其中链接连接成对的不相交页面集。通过将页面集划分为不重叠的块,并使用不同块的页面之间的链接来创建超弧,可以从网络图派生网络超图。超弧将页面块连接到单个页面,以便为链接分析提供更可靠的信息。我们使用超图模型来创建Pagerank和Indegree算法的超图版本,分别称为HyperPagerank和Hyperlndegree。超图是通过以下两种不同的分区标准对页面进行分组而从Web图形中得出的:将属于同一Web主机或同一Web域的页面分组在一起。考虑到页面信誉,页面文本内容和锚文本的组合,我们将原始的基于页面的算法与基于主机和基于域的算法进行了比较。使用三个不同的Web集合的实验结果表明,与Pagerank和Indegree算法的原始图形版本相比,HyperPagerank和HyperIndegree算法可能会产生更好的结果。我们还显示,该算法的超图版本受噪声链接和垃圾邮件的影响稍小。

著录项

  • 来源
    《Information Systems》 |2010年第5期|p.530-543|共14页
  • 作者单位

    Department of Computer Science, Federal University of Amazonas, Manaus, Brazil;

    rnDepartment of Computer Science, Federal University of Amazonas, Manaus, Brazil;

    rnDepartment of Computer Science, Federal University of Amazonas, Manaus, Brazil;

    FUCAPI, Analysis, Research and Tech, Innovation Center, Manaus, Brazil;

    rnDepartment of Computer Science, Federal University of Minas Gerais, Beta Horizonte, Brazil;

    rnInstitute of Informatics, Federal University of Goias, Goiania, Brazil;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    link analysis; web search;

    机译:链接分析;网络搜索;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号