首页> 外文会议>International conference on computational science and technology >GOW-LDA: Applying Term Co-occurrence Graph Representation in LDA Topic Models Improvement
【24h】

GOW-LDA: Applying Term Co-occurrence Graph Representation in LDA Topic Models Improvement

机译:GOW-LDA:在LDA主题模型改进中应用术语共现图表示

获取原文

摘要

In this paper, we demonstrate a novel approach in topic model exploration by applying word co-occurrence graph or graph-of-words (GOW) in order to produce more informative extracted latent topics from a large document corpus. According to the Latent Dirichlet Allocation (LDA) algorithm, it only considers the words occurrence independently via probabilistic distributions. It leads to the failure in term's relationship recognition. Hence in order to overcome this disadvantage of traditional LDA, we propose a novel approach, called GOW-LDA. The GOW-LDA is proposed that combines the GOW graph used in document representation, the frequent subgraph extracting and distribution model of LDA. For evaluation, we compare our proposed model with the traditional one in different classification algorithms. The comparative evaluation is performed in this study by using the standardized datasets. The results generated by the experiments show that the proposed algorithm yields performance respectably.
机译:在本文中,我们通过应用单词共现图或单词图(GOW)展示了一种用于主题模型探索的新颖方法,以便从大型文档语料库中提取出更多信息性的潜在主题。根据潜在狄利克雷分配(LDA)算法,它仅通过概率分布独立地考虑单词出现。这会导致术语的关系识别失败。因此,为了克服传统LDA的这一缺点,我们提出了一种新颖的方法,称为GOW-LDA。提出了将文档表示中使用的GOW图,LDA的频繁子图提取和分布模型相结合的GOW-LDA。为了进行评估,我们在不同的分类算法中将我们提出的模型与传统模型进行了比较。在这项研究中,使用标准化数据集进行了比较评估。实验结果表明,该算法具有良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号