首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Topic-sensitive PageRank: a context-sensitive ranking algorithm for Web search
【24h】

Topic-sensitive PageRank: a context-sensitive ranking algorithm for Web search

机译:主题敏感的PageRank:用于Web搜索的上下文相关的排名算法

获取原文
获取原文并翻译 | 示例

摘要

The original PageRank algorithm for improving the ranking of search-query results computes a single vector, using the link structure of the Web, to capture the relative "importance" of Web pages, independent of any particular search query. To yield more accurate search results, we propose computing a set of PageRank vectors, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic. For ordinary keyword search queries, we compute the topic-sensitive PageRank scores for pages satisfying the query using the topic of the query keywords. For searches done in context (e.g., when the search query is performed by highlighting words in a Web page), we compute the topic-sensitive PageRank scores using the topic of the context in which the query appeared. By using linear combinations of these (precomputed) biased PageRank vectors to generate context-specific importance scores for pages at query time, we show that we can generate more accurate rankings than with a single, generic PageRank vector. We describe techniques for efficiently implementing a large-scale search system based on the topic-sensitive PageRank scheme.
机译:用于提高搜索查询结果排名的原始PageRank算法使用Web的链接结构来计算单个向量,以捕获Web页面的相对“重要性”,而与任何特定的搜索查询无关。为了产生更准确的搜索结果,我们建议计算一组PageRank向量,并使用一组代表性主题进行偏置,以更准确地捕获相对于特定主题的重要性概念。对于普通的关键字搜索查询,我们使用查询关键字的主题为满足查询条件的页面计算主题敏感的PageRank分数。对于在上下文中进行的搜索(例如,当通过突出显示网页中的单词来执行搜索查询时),我们使用查询出现的上下文的主题来计算主题敏感的PageRank分数。通过使用这些(预先计算的)有偏差的PageRank向量的线性组合来生成查询时页面的特定于上下文的重要性得分,我们证明了与使用单个通用的PageRank向量相比,我们可以生成更准确的排名。我们描述了基于主题敏感的PageRank方案有效实现大型搜索系统的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号