首页> 外文期刊>The international arab journal of information technology >A Sparse Topic Model for Bursty Topic Discovery in Social Networks
【24h】

A Sparse Topic Model for Bursty Topic Discovery in Social Networks

机译:社交网络中突发主题发现的稀疏主题模型

获取原文
获取原文并翻译 | 示例
       

摘要

Bursty topic discovery aims to automatically identify bursty events and continuously keep track of known events. The existing methods focus on the topic model. However, the sparsity of short text brings the challenge to the traditional topic models because the words are too few to learn from the original corpus. To tackle this problem, we propose a Sparse Topic Model (STM) for bursty topic discovery. First, we distinguish the modeling between the bursty topic and the common topic to detect the change of the words in time and discover the bursty words. Second, we introduce "Spike and Slab" prior to decouple the sparsity and smoothness of a distribution. The bursty words are leveraged to achieve automatic discovery of the bursty topics. Finally, to evaluate the effectiveness of our proposed algorithm, we collect Sina weibo dataset to conduct various experiments. Both qualitative and quantitative evaluations demonstrate that the proposed STM algorithm outperforms favorably against several state-of-the-art methods.
机译:Bursty主题发现旨在自动识别突发事件并连续跟踪已知事件。现有方法专注于主题模型。然而,短文本的稀疏对传统主题模型带来了挑战,因为这些词汇太少,无法从原始语料库中学习。为了解决这个问题,我们提出了一个稀疏的主题模型(STM),用于突发主题发现。首先,我们区分突发主题与共同主题之间的建模,以检测单词及时的改变并发现突发单词。其次,我们在将稀疏性和分布的平滑性脱节之前引入“尖峰和平板”。爆发的单词被利用以实现突发主题的自动发现。最后,为了评估我们所提出的算法的有效性,我们收集新浪微博数据集进行各种实验。定性和定量评估都表明,所提出的STM算法优于几种最先进的方法优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号