首页> 外文会议>Joint international conference on semantic technology >A New Sentiment and Topic Model for Short Texts on Social Media
【24h】

A New Sentiment and Topic Model for Short Texts on Social Media

机译:社交媒体上短文本的新情感和主题模型

获取原文

摘要

Nowadays plenty of user-generated posts, e.g., tweets and sina weibos, are published on social media and the posts imply the public's opinions towards various topics. Joint sentiment/topic models are widely applied in detecting sentiment-aware topics on the lengthy documents. However, the characteristics of posts, i.e., short texts, on social media pose new challenges: (1) context sparsity problem of posts makes traditional sentiment-topic models inapplicable; (2) conventional sentiment-topic models are designed for flat documents without structure information, while publishing users, publishing timeslices and hashtags of posts provide rich structure information for these posts. In this paper, we firstly devise a method to mine potential hashtags, based on explicit hashtags, to further enrich structure information for posts, then we propose a novel Sentiment Topic Model for Posts (STMP) which aggregates posts with the structure information, i.e., timeslices, users and hashtags, to alleviate the context sparsity problem. Experiments on Sentimentl40 and Twitter7 show STMP outperforms previous models both in sentiment classification and sentiment-aware topic extraction.
机译:如今,许多用户生成的帖子(例如,tweet和sina weibos)已在社交媒体上发布,并且这些帖子暗示着公众对各个主题的看法。情感/主题联合模型广泛用于检测冗长文档中的情感主题。但是,社交媒体上帖子(即短文本)的特征带来了新的挑战:(1)帖子的上下文稀疏性问题使传统的情感主题模型不适用; (2)传统的情感主题模型设计用于没有结构信息的平面文档,而发布用户,发布帖子的时间片和主题标签为这些帖子提供了丰富的结构信息。在本文中,我们首先设计一种基于显式标签的挖掘潜在标签的方法,以进一步丰富帖子的结构信息,然后我们提出了一种新颖的帖子情感主题模型(STMP),该帖子将帖子与结构信息进行汇总,即时间片,用户和主题标签,以缓解上下文稀疏性问题。在Sentiment140和Twitter7上进行的实验表明,在情感分类和情感感知主题提取方面,STMP均优于以前的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号