首页> 外文期刊>International journal of semantic computing >Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation
【24h】

Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation

机译:探索对称和不对称的Dirichlet Priors潜在的Dirichlet分配

获取原文
获取原文并翻译 | 示例
       

摘要

Latent Dirichlet Allocation (LDA) has gained much attention from researchers and is increasingly being applied to uncover underlying semantic structures from a variety of corpora. However, nearly all researchers use symmetrical Dirichlet priors, often unaware of the underlying practical implications that they bear. This research is the first to explore symmetrical and asymmetrical Dirichlet priors on topic coherence and human topic ranking when uncovering latent semantic structures from scientific research articles. More specifically, we examine the practical effects of several classes of Dirichlet priors on 2000 LDA models created from abstract and full-text research articles. Our results show that symmetrical or asymmetrical priors on the document–topic distribution or the topic–word distribution for full-text data have little effect on topic coherence scores and human topic ranking. In contrast, asymmetrical priors on the document–topic distribution for abstract data show a significant increase in topic coherence scores and improved human topic ranking compared to a symmetrical prior. Symmetrical or asymmetrical priors on the topic–word distribution show no real benefits for both abstract and full-text data.
机译:潜在的Dirichlet分配(LDA)从研究人员中获得了很多关注,并且越来越多地应用于从各种Corpora揭开潜在的语义结构。然而,几乎所有研究人员都使用对称的Dirichlet Priors,往往没有意识到他们承担的潜在的实际意义。该研究是第一个在揭示科学研究文章中脱离潜在语义结构时探索对称和​​不对称的Dirichlet Priore排名的对称和不对称的Dirichlet Priors。更具体地说,我们研究了来自抽象和全文研究文章创建的2000年LDA模型的几类Dirichlet Priors的实际效果。我们的结果表明,文档主题分布的对称或不对称前瞻性或全文数据的主题字分布几乎没有对主题一致性分数和人类主题排名的影响。相反,抽象数据的文件主题分布上的不对称前瞻性显示出主题相干评分的显着增加,并与对称先前相比,人类主题排名。主题字分布上的对称或不对称前瞻显示了抽象和全文数据的真正福利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号