首页> 外文期刊>Information Processing & Management >Integration of multiple evidences based on a query type for web search
【24h】

Integration of multiple evidences based on a query type for web search

机译:基于查询类型的多个证据的集成以进行Web搜索

获取原文
获取原文并翻译 | 示例
       

摘要

The massive and heterogeneous Web exacerbates IR problems and short user queries make them worse. The contents of web pages are not enough to find answer pages. PageRank compensates for the insufficiencies of content information. The content information and PageRank are combined to get better results. However, static combination of multiple evidences may lower the retrieval performance. We have to use different strategies to meet the need of a user. We can classify user queries as three categories according to users' intent, the topic relevance task, the homepage finding task, and the service finding task. In this paper, we present a user query classification method. The difference of distribution, mutual information, the usage rate as anchor texts and the POS information are used for the classification. After we classified a user query, we apply different algorithms and information for the better results. For the topic relevance task, we emphasize the content information, on the other hand, for the homepage finding task, we emphasize the Link information and the URL information. We could get the best performance when our proposed classification method with the OKAPI scoring algorithm was used.
机译:庞大且异构的Web加剧了IR问题,而用户查询时间短使它们变得更糟。网页的内容不足以找到答案页面。 PageRank弥补了内容信息的不足。内容信息和PageRank结合在一起可获得更好的结果。但是,多个证据的静态组合可能会降低检索性能。我们必须使用不同的策略来满足用户的需求。我们可以根据用户的意图,主题相关性任务,主页查找任务和服务查找任务将用户查询分为三类。在本文中,我们提出了一种用户查询分类方法。分配的差异,相互信息,作为锚文本的使用率和POS信息用于分类。在对用户查询进行分类之后,我们将应用不同的算法和信息以获得更好的结果。对于主题相关性任务,我们强调内容信息,而对于首页查找任务,我们强调链接信息和URL信息。当使用我们提出的带有OKAPI评分算法的分类方法时,我们可以获得最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号