首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Inter and Intra Topic Structure Learning with Word Embeddings
【24h】

Inter and Intra Topic Structure Learning with Word Embeddings

机译:与Word Embeddings学习的间际和内部主题结构

获取原文
           

摘要

One important task of topic modeling for text analysis is interpretability. By discovering structured topics one is able to yield improved interpretability as well as modeling accuracy. In this paper, we propose a novel topic model with a deep structure that explores both inter-topic and intra-topic structures informed by word embeddings. Specifically, our model discovers inter topic structures in the form of topic hierarchies and discovers intra topic structures in the form of sub-topics, each of which is informed by word embeddings and captures a fine-grained thematic aspect of a normal topic. Extensive experiments demonstrate that our model achieves the state-of-the-art performance in terms of perplexity, document classification, and topic quality. Moreover, with topic hierarchies and sub-topics, the topics discovered in our model are more interpretable, providing an illuminating means to understand text data.
机译:文本分析建模的一个重要任务是解释性。通过发现结构化主题,能够产生改进的可解释性以及建模精度。在本文中,我们提出了一种新颖的主题模型,具有深度结构,探讨了Word Embeddings通知的主题间和主题内部结构。具体而言,我们的模型以主题层次结构的形式发现主题结构,并以子主题的形式发现帧内主题结构,每个主题由Word Embeddings通知并捕获正常主题的细粒度主题方面。广泛的实验表明,我们的模型在困惑,文档分类和主题质量方面实现了最先进的性能。此外,对于主题层次结构和子主题,我们模型中发现的主题更具可解释,提供了理解文本数据的照明手段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号