首页> 外文会议>Advances in Knowledge Discovery and Data Mining >An Efficient Algorithm for Incremental Update of Concept Spaces
【24h】

An Efficient Algorithm for Incremental Update of Concept Spaces

机译:一种有效的概念空间增量更新算法

获取原文

摘要

The vocabulary problem in information retrieval arises because authors and indexers often use different terms for the same concept. A thesaurus defines mappings between different but related terms. It is widely used in modern information retrieval systems to solve the vocabulary problem. Chen et al. proposed the concept space approach to automatic thesaurus construction. A concept space contains the associations between every pair of terms. Previous research studies show that concept space is a useful tool for helping information searchers in revising their queries in order to get better results from information retrieval systems. The construction of a concept space, however, is very computationally intensive. In this paper, we propose and evaluate an efficient algorithm for the incremental update of concept spaces. In our model, only strong associations are maintained, since they are most useful in thesauri construction. Our algorithm uses a pruning technique to avoid computing weak associations to achieve efficiency.
机译:信息检索中的词汇问题之所以出现,是因为作者和索引者经常对同一概念使用不同的术语。同义词库定义了不同但相关的术语之间的映射。它广泛用于现代信息检索系统中,以解决词汇问题。 Chen等。提出了自动同义词库构建的概念空间方法。概念空间包含每对术语之间的关联。先前的研究表明,概念空间是帮助信息搜索者修改其查询以从信息检索系统获得更好结果的有用工具。但是,概念空间的构建需要大量的计算。在本文中,我们提出并评估了一种有效的概念空间增量更新算法。在我们的模型中,仅保留了强关联,因为它们在叙词表构建中最有用。我们的算法使用修剪技术来避免计算弱关联以获得效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号