首页> 外文会议> >An investigation of the TREC Web track datasets based on the hyperlink analysis algorithm
【24h】

An investigation of the TREC Web track datasets based on the hyperlink analysis algorithm

机译:基于超链接分析算法的TREC Web轨道数据集研究

获取原文

摘要

One of the main aims of TREC (text retrieval conference) Web track has been to answer the question if link-based methods are better than keyword-based methods for Web search, but most of the participations including us find that the hyperlink structure cannot improve search effectiveness as some commercial search engine claimed. This paper tries to find the reason by investigating the WT10G, the .GOV dataset, the answer-sets and the TREC evaluation measure. We propose our assumption about the link-based methods and prove its correction in these two datasets. How to get a better result by the link-based methods was found in TREC datasets by some experiments. Some suggestions for the TREC datasets collection and evaluation measure are also given this paper.
机译:TREC(文本检索会议)Web跟踪的主要目的之一是回答以下问题:基于链接的方法是否优于基于关键字的Web搜索方法,但是包括我们在内的大多数参与者都发现超链接结构无法改善某些商业搜索引擎声称的搜索效果。本文试图通过调查WT10G,.GOV数据集,答案集和TREC评估措施来找到原因。我们提出关于基于链接的方法的假设,并在这两个数据集中证明其正确性。通过一些实验,在TREC数据集中发现了如何通过基于链接的方法获得更好的结果。本文还针对TREC数据集的收集和评估措施提出了一些建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号