首页> 外文会议>The Fourth International Conference on Systems Science and Systems Engineering (ICSSSE'03); Nov 25-28, 2003; Hong Kong SAR, China >IMPROVING THE VECTOR SPACE RETRIEVAL MODEL BY USING TERMS'S MODIFIERS AND CONSTRUCTING FUZZY SYNONYM THESAURUS
【24h】

IMPROVING THE VECTOR SPACE RETRIEVAL MODEL BY USING TERMS'S MODIFIERS AND CONSTRUCTING FUZZY SYNONYM THESAURUS

机译:利用术语修饰符和构建模糊语义同义词库改进矢量空间检索模型

获取原文
获取原文并翻译 | 示例

摘要

An information retrieval (IR) system can facilitate users to quickly and efficiently find out the documents that are relevant to the users' requirements. However, most current keyword-based IR methods often generate large trashes and miss important information since separated words/terms are involved in the retrieval process. Treating words/terras in the given query separately will lead to a number of irrelevant documents retrieved. To deal with this problem, we accordingly propose a new method in which the relationships among words are fully considered. For expanding the user's query, the fuzzy synonym thesaurus is built. Through the proposed method, we can retrieve relevant documents in a relatively narrow search space and meanwhile widen the coverage of the retrieval to the related documents that do not necessarily contain the same words as the query. As a result, the retrieving results obtained from the modified keyword-based IR method show some improvements in two metrics, precision and recall.
机译:信息检索(IR)系统可以帮助用户快速有效地查找与用户需求相关的文档。但是,由于检索过程中涉及单独的单词/术语,因此大多数当前基于关键字的IR方法通常会产生大量垃圾,并且会丢失重要信息。在给定查询中单独处理单词/ terras将导致检索到许多不相关的文档。为了解决这个问题,我们相应地提出了一种新方法,其中充分考虑了单词之间的关系。为了扩展用户的查询,建立了模糊同义词同义词库。通过提出的方法,我们可以在相对狭窄的搜索空间中检索相关文档,同时将检索范围扩大到不必包含与查询相同词的相关文档。结果,从改进的基于关键字的IR方法获得的检索结果显示出在两个指标(精度和查全率)方面的一些改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号