首页> 外文期刊>International journal on Semantic Web and information systems >Semantic Characterization of Tweets Using Topic Models: A Use Case in the Entertainment Domain
【24h】

Semantic Characterization of Tweets Using Topic Models: A Use Case in the Entertainment Domain

机译:使用主题模型对推文进行语义表征:娱乐领域的一个使用案例

获取原文
获取原文并翻译 | 示例
           

摘要

In the entertainment domain users tweet about their expectations and opinions regarding upcoming, current and past experiences, while companies advertise and promote the shows. This characterization, important for customers and companies, goes beyond traditional sentiment analysis where the polarity of the sentiments expressed in opinions is usually identified as positive, negative or neutral. The authors investigate different tweet representation models, including bags of words and probabilistic topic models, to shed light on the semantics of the messages. Their experiments show that topic-based models generated with Latent Dirichlet Allocation (LDA) yield, most of the times, better categorizations when compared to TF-IDF based features, particularly when these models are enriched with natural language features and specific Twitter slang.
机译:在娱乐领域,用户在公司进行广告宣传和宣传的同时,在推特上发布有关他们对未来,当前和过去体验的期望和意见。对于客户和公司而言,这种表征非常重要,它超越了传统的情感分析,在传统的情感分析中,意见表达的情感极性通常被确定为积极,消极或中立。作者研究了不同的tweet表示模型,包括单词袋和概率主题模型,以阐明消息的语义。他们的实验表明,与基于TF-IDF的功能相比,使用潜在狄利克雷分配(LDA)生成的基于主题的模型在大多数情况下会产生更好的分类,特别是当这些模型富含自然语言功能和特定的Twitter lang语时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号