...
首页> 外文期刊>Quality Control, Transactions >ULW-DMM: An Effective Topic Modeling Method for Microblog Short Text
【24h】

ULW-DMM: An Effective Topic Modeling Method for Microblog Short Text

机译:ULW-DMM:微博短文本的有效主题建模方法

获取原文
获取原文并翻译 | 示例

摘要

With the popularity of social media, including micro-blog, mining effective information in short texts has become an increasingly important issue. However, due to the sparseness, high dimensionality and large amount of data, mining this information is a very challenging task. In this paper, we propose a method to extend the Dirichlet multinomial mixture (DMM) topic model by combining the user-LDA topic model based on internal data expansion with the potential feature vector representation of words trained on a very large external corpus (we refer to it as ULW-DMM). The experimental results show that the ULW-DMM model produces a relatively large improvement in topic consistency and classification tasks for topic modeling of microblog short texts.
机译:随着包括微博客在内的社交媒体的普及,在短文本中挖掘有效信息已成为越来越重要的问题。但是,由于稀疏,高维和大量数据,挖掘此信息是一项非常具有挑战性的任务。在本文中,我们提出了一种方法,通过将基于内部数据扩展的用户-LDA主题模型与在非常大的外部语料库上训练的单词的潜在特征矢量表示相结合,来扩展狄利克雷多项混合主题(DMM)主题模型作为ULW-DMM)。实验结果表明,ULW-DMM模型在主题一致性和分类任务方面产生了相对较大的改进,用于微博短文本的主题建模。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号