【24h】

VIPAS: Virtual Link Powered Authority Search in the Web

机译:VIPAS:在网络中通过虚拟链接提供动力的授权机构搜索

获取原文
获取原文并翻译 | 示例

摘要

With the exponential growth of the World Wide Web, looking for pages with high quality and relevance in the Web has become an important research field. There have been many keyword-based search engines built for this purpose. However, these search engines usually suffer from the problem that a relevant Web page may not contain the keyword in its page text. Algorithms exploiting the link structure of Web documents, such as HITS, have also been proposed to overcome the problems of traditional search engines. Though these algorithms perform better than keyword-based search engines, they still have some defects. Among others, one major problem is that links in Web pages are only able to reflect the view of the page authors on the topic of those pages but not that of the page readers. In this paper, we propose a new algorithm with the idea of using virtual links which are created according to what the user behaves in browsing the output list of the query result. These virtual links are then employed to identify authoritative resources in the Web. Specifically, the algorithm, referred to as algorithm VIPAS (standing for virtual link powered authority search), is divided into three phases. The first phase performs basic link analysis. The second phase collects statistics by observing the user behavior in browsing pages listed in the query result, and virtual links are then created according to what observed. In the third phase, these virtual links as well as real ones are taken together to produce an updated list of authoritative pages that will be presented to the user when the query with similar keywords is encountered next time. A Web warehouse is built and the algorithm is integrated into the system. By conducting experiments on the system, we have shown that VIPAS is not only very effective but also very adaptive in providing much more valuable information to users.
机译:随着万维网的指数增长,在网络中寻找高质量和相关的页面已成为重要的研究领域。为此已经建立了许多基于关键字的搜索引擎。但是,这些搜索引擎通常遭受以下问题:相关网页的页面文本中可能不包含关键字。还提出了利用Web文档链接结构的算法,例如HITS,以克服传统搜索引擎的问题。尽管这些算法的性能比基于关键字的搜索引擎更好,但它们仍然存在一些缺陷。其中一个主要问题是,Web页面中的链接只能反映页面作者对那些页面主题的看法,而不能反映页面阅读者的观点。在本文中,我们提出了一种新算法,其使用虚拟链接的想法是根据用户在浏览查询结果的输出列表时的行为创建的。然后使用这些虚拟链接来标识Web中的权威资源。具体而言,称为算法VIPAS(代表虚拟链接供电的授权搜索)的算法分为三个阶段。第一阶段执行基本链接分析。第二阶段通过观察用户在浏览查询结果中列出的页面时的行为来收集统计信息,然后根据观察到的内容创建虚拟链接。在第三阶段中,将这些虚拟链接和真实链接合并在一起,以生成权威页面的更新列表,当下次遇到具有类似关键字的查询时,这些页面将呈现给用户。建立一个Web仓库,并将算法集成到系统中。通过在系统上进行实验,我们表明VIPAS在向用户提供更多有价值的信息方面不仅非常有效,而且也非常适应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号