...
【24h】

Aggregated topic models for increasing social media topic coherence

机译:增加社交媒体主题一致性的聚合主题模型

获取原文
获取原文并翻译 | 示例
           

摘要

This research presents a novel aggregating method for constructing an aggregated topic model that is composed of the topics with greater coherence than individual models. When generating a topic model, a number of parameters have to be specified. The resulting topics can be very general or very specific, which depend on the chosen parameters. In this study we investigate the process of aggregating multiple topic models generated using different parameters with a focus on whether combining the general and specific topics is able to increase topic coherence. We employ cosine similarity and Jensen-Shannon divergence to compute the similarity among topics and combine them into an aggregated model when their similarity scores exceed a predefined threshold. The model is evaluated against the standard topics models generated by the latent Dirichlet allocation and Non-negative Matrix Factorisation. Specifically we use the coherence of topics to compare the individual models that create aggregated models against those of the aggregated model and models generated by Non-negative Matrix Factorisation, respectively. The results demonstrate that the aggregated model outperforms those topic models at a statistically significant level in terms of topic coherence over an external corpus. We also make use of the aggregated topic model on social media data to validate the method in a realistic scenario and find that again it outperforms individual topic models.
机译:该研究提出了一种新的聚合方法,用于构建由具有比单个模型更强的主题组成的汇总主题模型。生成主题模型时,必须指定许多参数。结果主题可以是非常一般的或非常特定的,这取决于所选参数。在这项研究中,我们调查使用不同参数生成的多主题模型的过程,重点是组合一般和特定主题是否能够增加主题连贯性。我们使用余弦相似性和Jensen-Shannon发散来计算主题之间的相似性,并在其相似度得分超过预定阈值时将它们组合成聚合模型。根据潜在的Dirichlet分配和非负矩阵分子生成的标准主题模型来评估该模型。具体而言,我们使用主题的一致性来比较分别为非负矩阵分子生成的聚合模型和模型创建聚合模型的各个模型。结果表明,聚合模型在外部语料库上的主题一致性方面以统计学上显着的级别优于这些主题模型。我们还在社交媒体数据上利用聚合主题模型,以验证在现实方案中的方法,并再次发现它优于个体主题模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号