首页> 外文会议>International Joint Conference on Artificial Intelligence >Discovering Different Types of Topics: Factored Topic Models
【24h】

Discovering Different Types of Topics: Factored Topic Models

机译:发现不同类型的主题:考虑因素主题模型

获取原文

摘要

In traditional topic models such as LDA, a word is generated by choosing a topic from a collection. However, existing topic models do not identify different types of topics in a document, such as topics that represent the content and topics that represent the sentiment. In this paper, our goal is to discover such different types of topics, if they exist. We represent our model as several parallel topic models (called topic factors), where each word is generated from topics from these factors jointly. Since the latent membership of the word is now a vector, the learning algorithms become challenging. We show that using a variational approximation still allows us to keep the algorithm tractable. Our experiments over several datasets show that our approach consistently outperforms many classic topic models while also discovering fewer, more meaningful, topics.
机译:在诸如LDA之类的传统主题模型中,通过从集合中选择主题来生成一个单词。但是,现有主题模型不会在文档中识别不同类型的主题,例如表示表示情绪的内容和主题的主题。在本文中,我们的目标是发现这种不同类型的主题,如果存在。我们代表我们的模型作为若干并行主题模型(称为主题因子),其中每个单词都是从这些因素的主题中生成的。由于这个词的潜在成员现在是向量,所以学习算法变得具有挑战性。我们表明,使用变形近似仍然可以让我们保持算法易于遗传。我们在多个数据集上的实验表明,我们的方法始终如一地占有许多经典主题模型,同时发现更少,更有意义,主题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号