首页> 中文期刊> 《计算机与现代化》 >面向水利信息资源目录服务的分布式语义检索方法研究

面向水利信息资源目录服务的分布式语义检索方法研究

         

摘要

针对水利信息资源目录服务中资源发现服务高查全率和实时性的需求,提出一种基于语义扩展的分布式元数据检索方法。该方法利用《水利公文主题词表》构建领域本体结合知网语义实现专业词汇与通用词汇的扩展,定义语义推理规则和词汇相关度,并结合推理机以支撑查询词汇的扩展;同时定义相似度阈值和选择方法防止“语义飘移”以保证检索查准率;采用语义相似度和文本相似度相结合的方式进行结果排序;基于MapReduce对索引创建和查询处理进行并行化改造提高检索的处理效率。%Addressing the demand of high recall rate and real-time for the resource discovery services in water information re-sources directory services, a distributed metadata retrieval method ( DSRM) based on semantic extension is proposed.It con-structs the domain ontology with“Official Document Thesaurus of Water Resources”, achieves the extension of the specialized vo-cabulary and the common vocabulary by combining with the semanteme based on Hownet, defines the semantic inference rules and the vocabulary correlativity, and supports the extension of the query vocabulary by combining with the inference machine.Mean-while, DSRM defines the threshold of similarity and the selection method, which can prevent the“semantic drift”, to ensure the searching accuracy.The searching results are ranked according to the combination of the semantic similarity and the text similari-ty.The MapReduce-based parallelization reform of the index creation and the query processing improves the processing efficiency of retrieval.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号