首页> 外文期刊>International journal of metadata, semantics and ontologies >The impact of titles expansion based on ontology in document retrieval
【24h】

The impact of titles expansion based on ontology in document retrieval

机译:基于本体的标题扩展对文档检索的影响

获取原文
获取原文并翻译 | 示例
       

摘要

Among the features of documents is their logical structure, which represents their components such as chapters, sections, paragraphs, titles, chapter titles, etc. The titles and subtitles of documents are meaningful; they are good indicators of the paragraphs' content. For this reason particular attention should be paid to these titles during the indexing process and research. The terms of the titles are the most important in the document, but their number is very limited, because of their shortness, which leads to irrelevant results from information retrieval (IR). One possible solution is to extend titles by adding other terms that have a semantic similarity with initial terms. The present work is an attempt to study the effect of extending the most important terms in the documents on information retrieval. Experiments on a large corpus, INEX 2009, show the effectiveness of the proposition and an improvement in the precision of the results in IR.
机译:文档的特征包括其逻辑结构,这些逻辑结构代表其组成部分,例如章节,节,段落,标题,章节标题等。文档的标题和副标题有意义;它们是段落内容的良好指示。因此,在索引过程和研究过程中应特别注意这些标题。标题的术语在文档中是最重要的,但由于数量短,标题的数量非常有限,这导致信息检索(IR)的结果不相关。一种可能的解决方案是通过添加与初始术语具有语义相似性的其他术语来扩展标题。当前的工作是试图研究扩展文档中最重要的术语对信息检索的影响。在大型语料库INEX 2009上进行的实验证明了该建议的有效性,并提高了IR结果的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号