首页> 外文会议>IEEE International Conference on Software Engineering and Service Science >Mongolian information retrieval method based on LDA model
【24h】

Mongolian information retrieval method based on LDA model

机译:基于LDA模型的蒙古语信息检索方法

获取原文

摘要

A new method based on Latent Dirichlet Allocation (LDA) is proposed to retrieval information in Mongolian. Semantic information is also considered by Mongolian documents when consider relationship between keywords and retrieval documents. This method models Mongolian documents with LDA, parameters are estimated with Gibbs sampling and probability of word is represented, it can mine the hidden relationship between the different topics and the words from documents, get the topic distribution and compute the similarity of keywords topics. Finally, return to the most relevant documents with topics. Experimental results show that the method has a higher performance in topic semantic compared with vector space model and Language model.
机译:提出了一种基于潜在狄利克雷分配(LDA)的蒙古文信息检索新方法。当考虑关键词和检索文档之间的关系时,蒙古文文档也会考虑语义信息。该方法利用LDA对蒙古文文档进行建模,用Gibbs采样估计参数,并表示单词的概率,可以挖掘出文档中不同主题与单词之间的隐蔽关系,得到主题分布并计算关键词主题的相似度。最后,返回与主题最相关的文档。实验结果表明,与向量空间模型和语言模型相比,该方法在主题语义上具有更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号