首页> 中文期刊> 《新疆农业大学学报》 >基于双重语义空间的农业信息检索模型研究

基于双重语义空间的农业信息检索模型研究

         

摘要

为了提高针对大规模农业信息的语义检索性能,提出一种基于改进的随机索引语义空间和潜在语义空间的农业信息检索模型(IRI&LSA).利用120万张中文网页和2 000张分为4类的小规模中文农业网页,对IRI&LSA和两种分别基于单向量兰克泽斯算法(LAS2)和半离散矩阵分解算法(SDD)的常用潜在语义检索模型(LSA-LAS2和LSA-SDD)进行了对比实验.结果表明,IRI&LSA检索结果的平均F1值可达83%,明显高于LSA-LAS2(71%)和LSA-SDD(64%);IRI&LSA的检索速度分别是LSA-LAS2和LSA-SDD的3.6倍和4.9倍.研究结果表明,IRI& LSA适合应用于较大规模农业信息检索.%In order to improve semantic retrieval function of massive agricultural information,an agricultural information search modle (IRI&.LSA) was proposed,based on improved radom index semantic space and latent sematic space. The contrast experiments were conducted between IRI&LSA and two commonly used latent semantic models (LSA-LAS2 and LSA-SDD) by using 1. 2 million Chinese web pages and 2 000 Chinese agricultural web pages that were divided into four categories,based on single-vector lanczos algorithm (LAS2) and semi-discrete matrix decomposition algorithm (SDD) respectively. These results showed that the average Fl value of search results of IRIcV-LSA reached 83% that was significantly higher than LSA-LAS2(71%) and LSA-SDD(64%); retrieval speed of IRI&LSA was LSA-LAS2's 3. 6 times and LSA-SDD's 4. 9 times. Experimental results showed that IRI&-LSA was suitable for massive agricultural information retrieval.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号