首页> 外文会议>ACM international conference on information and knowledge management >Contextualization using Hyperlinks and Internal Hierarchical Structure of Wikipedia Documents
【24h】

Contextualization using Hyperlinks and Internal Hierarchical Structure of Wikipedia Documents

机译:使用超链接和Wikipedia文档的内部层次结构进行上下文化

获取原文

摘要

Context surrounding hyperlinked semi-structured documents, externally in the form of citations and internally in the form of hierarchical structure, contains a wealth of useful but implicit evidence about a document's relevance. These rich sources of information should be exploited as contextual evidence. This paper proposes various methods of accumulating evidence from the context, and measures the effect of contextual evidence on retrieval effectiveness for document and focused retrieval of hyperlinked semi-structured documents. We propose a re-weighting model to contextualize (a) evidence from citations in a query-independent and query-dependent fashion (based on Markovian random walks) and (b) evidence accumulated from the internal tree structure of documents. The in-links and out-links of a node in the citation graph are used as external context, while the internal document structure provides internal, within-document context. We hypothesize that documents in a good context (having strong contextual evidence) should be good candidates to be relevant to the posed query, and vice versa. We tested several variants of contextualization and verified notable improvements in comparison with the baseline system and gold standards in the retrieval of full documents and focused elements.
机译:围绕超链接的半结构文档的上下文(在外部以引文的形式在内部以层次结构的形式在内部)包含大量有用但隐含的关于​​文档相关性的证据。这些丰富的信息资源应作为背景证据加以利用。本文提出了多种从上下文中收集证据的方法,并测量了上下文证据对文档检索效率和超链接半结构化文档的集中检索的影响。我们提出一种重新加权模型,以将(a)来自引用的查询以独立于查询和依赖于查询的方式(基于马尔可夫随机游动)和(b)从文档的内部树结构中积累的证据进行上下文关联。引用图中节点的入站和出站链接用作外部上下文,而内部文档结构则提供内部文档内部上下文。我们假设在良好上下文中(具有强大的上下文证据)的文档应该是与提出的查询相关的良好候选者,反之亦然。我们测试了多种语境化变体,并验证了与基线系统和黄金标准相比,在检索完整文档和重点内容方面的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号