首页> 外文会议>Web information systems and mining >Conceptual Representing of Documents and Query Expansion Based on Ontology
【24h】

Conceptual Representing of Documents and Query Expansion Based on Ontology

机译:基于本体的文档概念表示与查询扩展

获取原文
获取原文并翻译 | 示例

摘要

In vector space model, a document is represented by words. As the new words appear dramatically in the Internet era. this kind of method draws back the IR systems performance. This paper puts forward a new approach to present the concepts, query expressions, and documents based on the ontology. The approach has two levels, the Word-Concept level and the Concept-Document level. In the first level, the transition probability matrix is constructed by using the appearing times of word-word pairs in documents. The biggest eigenvector of matrix is computed, and it reflects the importance of words to the concept. In the second level, the distance matrix is constructed by using the distance between words in a given ontology, and the average variance value of elements is computed. It reflects the relevance of documents to concepts. In the last section, the query expansion is discussed by using the personal information profile of the user. It is proofed to be more effective than previous one.
机译:在向量空间模型中,文档用单词表示。随着新词在互联网时代的出现,戏剧性地出现了。这种方法会降低红外系统的性能。本文提出了一种新的方法来表示基于本体的概念,查询表达式和文档。该方法有两个级别,单词概念级别和概念文档级别。在第一层中,通过使用单词-单词对在文档中的出现时间来构造转移概率矩阵。计算矩阵的最大特征向量,它反映了单词对概念的重要性。在第二级中,通过使用给定本体中单词之间的距离构造距离矩阵,并计算元素的平均方差值。它反映了文档与概念的相关性。在最后一部分中,使用用户的个人信息配置文件讨论了查询扩展。事实证明,它比以前的方法更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号