首页> 外文期刊>Journal of software >Web Event Topic Analysis by Topic Feature Clustering and Extended LDA Model
【24h】

Web Event Topic Analysis by Topic Feature Clustering and Extended LDA Model

机译:基于主题特征聚类和扩展LDA模型的Web事件主题分析

获取原文
获取原文并翻译 | 示例
       

摘要

To analyze topics of a large number of web events, we proposed an event topic analysis approach by topic feature clustering and extended LDA (latent dirichlet allocation) model. The extended LDA model is dimension LDA (DLDA) which integrates topic probability of LDA model. We represent an event as a multi-dimensions vector and use DLDA model to select topic feature words in events. We aggregate events which have a common topic by topic feature clustering. In clustering process we use dynamic K-means method to automatically select suitable number of clusters. In this paper a topic term generating rule is proposed to compose topic terms by clustered topic feature words. We accurately detect a common topic from lots of different events and analyze topic terms for events. Experiments on dataset results show that the web event topic analysis approach has high accuracy.
机译:为了分析大量网络事件的主题,我们通过主题特征聚类和扩展的LDA(潜在狄利克雷分配)模型提出了一种事件主题分析方法。扩展的LDA模型是维度LDA(DLDA),它集成了LDA模型的主题概率。我们将事件表示为多维向量,并使用DLDA模型在事件中选择主题特征词。我们按主题特征聚类汇总具有共同主题的事件。在聚类过程中,我们使用动态K均值方法自动选择合适数量的聚类。本文提出了一个主题词生成规则,通过聚类的主题特征词组成主题词。我们从许多不同的事件中准确地检测出一个公共主题,并分析事件的主题词。对数据集结果的实验​​表明,网络事件主题分析方法具有较高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号