首页> 外文会议>IEEE International Conference on Software Engineering and Service Science >Mongolian information retrieval method based on LDA model
【24h】

Mongolian information retrieval method based on LDA model

机译:基于LDA模型的蒙古信息检索方法

获取原文

摘要

A new method based on Latent Dirichlet Allocation (LDA) is proposed to retrieval information in Mongolian. Semantic information is also considered by Mongolian documents when consider relationship between keywords and retrieval documents. This method models Mongolian documents with LDA, parameters are estimated with Gibbs sampling and probability of word is represented, it can mine the hidden relationship between the different topics and the words from documents, get the topic distribution and compute the similarity of keywords topics. Finally, return to the most relevant documents with topics. Experimental results show that the method has a higher performance in topic semantic compared with vector space model and Language model.
机译:提出了一种基于潜在Dirichlet分配(LDA)的新方法,以在蒙古语中检索信息。蒙古文档也考虑在考虑关键字和检索文档之间的关系时,还考虑语义信息。此方法使用LDA模型蒙古文档,参数估计Gibbs采样和单词的概率表示,它可以在不同主题与文档之间的隐藏关系,获取主题分布并计算关键字主题的相似性。最后,用主题返回最相关的文件。实验结果表明,与矢量空间模型和语言模型相比,该方法在语义上具有更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号