首页> 外文会议>Advances in Information Retrieval >The Importance of Link Evidence in Wikipedia
【24h】

The Importance of Link Evidence in Wikipedia

机译:维基百科中链接证据的重要性

获取原文
获取原文并翻译 | 示例

摘要

Wikipedia is one of the most popular information sources on the Web. The free encyclopedia is densely linked. The link structure in Wikipedia differs from the Web at large: internal links in Wikipedia are typically based on words naturally occurring in a page, and link to another semantically related entry. Our main aim is to find out if Wikipedia's link structure can be exploited to improve ad hoc information retrieval. We first analyse the relation between Wikipedia links and the relevance of pages. We then experiment with use of link evidence in the focused retrieval of Wikipedia content, based on the test collection of INEX 2006. Our main findings are: First, our analysis of the link structure reveals that the Wikipedia link structure is a (possibly weak) indicator of relevance. Second, our experiments on INEX ad hoc retrieval tasks reveal that if the link evidence is made sensitive to the local context we see a significant improvement of retrieval effectiveness. Hence, in contrast with earlier TREC experiments using crawled Web data, we have shown that Wikipedia's link structure can help improve the effectiveness of ad hoc retrieval.
机译:维基百科是网络上最受欢迎的信息来源之一。自由的百科全书是紧密联系在一起的。 Wikipedia中的链接结构不同于整个Web:Wikipedia中的内部链接通常基于页面中自然出现的单词,并链接到另一个与语义相关的条目。我们的主要目的是确定是否可以利用Wikipedia的链接结构来改善即席信息检索。我们首先分析Wikipedia链接和页面相关性之间的关系。然后,基于INEX 2006的测试集,我们尝试使用链接证据来集中检索Wikipedia内容。我们的主要发现是:首先,我们对链接结构的分析表明Wikipedia链接结构是(可能是弱的)相关性指标。第二,我们对INEX临时检索任务的实验表明,如果使链接证据对本地情况敏感,我们将看到检索效率的显着提高。因此,与早期使用爬网Web数据的TREC实验相反,我们已经表明Wikipedia的链接结构可以帮助提高即席检索的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号