首页> 外文期刊>ACM transactions on Asian language information processing >Chinese Information Retrieval Based on Terms and Relevant Terms
【24h】

Chinese Information Retrieval Based on Terms and Relevant Terms

机译:基于术语和相关术语的中文信息检索

获取原文
获取原文并翻译 | 示例
       

摘要

In this article we describe our approach to Chinese information retrieval, where a query is a short natural language description. First, we use automatically extracted short terms from document sets to build indexes and use the short terms in both the query and documents to do initial retrieval. Next, we use long terms extracted from the document collection to reorder the top N retrieved documents to improve precision. Finally, we acquire the relevant terms of the short terms from the Internet and the top retrieved documents and use them to do query expansion. Experiments on the NTCIR-4 CLIR Chinese SLIR sub-collection show that document reranking can both improve the retrieval performance on its own and make a significant contribution to query expansion. The experiments also show that the extended query expansion proposed in this article is more effective than the standard Rocchio query expansion.
机译:在本文中,我们描述了我们的中文信息检索方法,其中查询是一种简短的自然语言描述。首先,我们使用从文档集中自动提取的短期词汇来建立索引,并在查询和文档中都使用短期词汇进行初始检索。接下来,我们使用从文档集合中提取的长期术语对前N个检索到的文档进行重新排序,以提高准确性。最后,我们从Internet和检索到的最热门文档中获取短期术语的相关术语,并使用它们进行查询扩展。在NTCIR-4 CLIR中文SLIR子集合上进行的实验表明,文档重新排序不仅可以提高检索性能,而且可以为查询扩展做出重大贡献。实验还表明,本文提出的扩展查询扩展比标准Rocchio查询扩展更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号