【24h】

Targeted aspects oriented topic modeling for short texts

机译:针对短文本的面向目标的主题建模

获取原文
获取原文并翻译 | 示例
           

摘要

Topic modeling has demonstrated its value in short text topic discovery. For this task, a common way adopted by many topic models is to perform a full analysis to find all the possible topics. However, these topic models overlook the importance of deeper topics, leading to confusing topics discovered. In practice, people always tend to find more focused topics on some special aspects (or events), rather than a set of coarse topics. Therefore, in this paper, we propose a novel method, Targeted Aspects Oriented Topic Modeling (TATM), to discover more focused topics on specific aspects in short texts. Specifically, each short text is assigned to only one targeted aspect derived from an enhanced Dirichlet Multinomial Mixture process (E-DMM). This process helps group similar words as many as possible, which achieves topic homogeneity. In addition, TATM discovers the topics for each targeted aspect from as many angles as possible by performing target-level modeling, which achieves topic completeness. Thus, TATM can make a balance between the two conflicting properties without employing any additional information or pre-trained knowledge. The extensive experiments conducted on five real-world datasets demonstrate that our proposed model can effectively discover more focused and complete topics, and it outperforms the state-of-the-art baselines.
机译:主题建模在短文本主题发现中展示了它的价值。对于此任务,许多主题模型采用的常见方式是执行完整的分析以查找所有可能的主题。但是,这些主题模型忽略了更深层主题的重要性,导致发现令人困惑的主题。在实践中,人们总是倾向于在某些特殊方面(或事件),而不是一组粗大主题找到更多的专注主题。因此,在本文中,我们提出了一种新颖的方法,目标方面的面向主题建模(TATM),以在短文本中发现比特定方面的更多专注的主题。具体地,每个短文本仅分配给源自增强的Dirichlet多项式混合过程(E-DMM)的一个目标方面。此过程可帮助尽可能多的单词,从而实现主题同质性。此外,TATM通过执行目标级模型尽可能多的角度发现每个目标方面的主题,这实现了主题完整性。因此,TATM可以在两个相互冲突的属性之间进行平衡,而不使用任何其他信息或预先接受训练的知识。在五个现实世界数据集中进行的广泛实验表明,我们的拟议模型可以有效地发现更加集中和完整的主题,并且它优于最先进的基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号