...
首页> 外文期刊>Knowledge-Based Systems >Meta-level sentiment models for big social data analysis
【24h】

Meta-level sentiment models for big social data analysis

机译:用于大型社交数据分析的元级情感模型

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

People react to events, topics and entities by expressing their personal opinions and emotions. These reactions can correspond to a wide range of intensities, from very mild to strong. An adequate processing and understanding of these expressions has been the subject of research in several fields, such as business and politics. In this context, Twitter sentiment analysis, which is the task of automatically identifying and extracting subjective information from tweets, has received increasing attention from the Web mining community. Twitter provides an extremely valuable insight into human opinions, as well as new challenging Big Data problems. These problems include the processing of massive volumes of streaming data, as well as the automatic identification of human expressiveness within short text messages. In that area, several methods and lexical resources have been proposed in order to extract sentiment indicators from natural language texts at both syntactic and semantic levels. These approaches address different dimensions of opinions, such as subjectivity, polarity, intensity and emotion. This article is the first study of how these resources, which are focused on different sentiment scopes, complement each other. With this purpose we identify scenarios in which some of these resources are more useful than others. Furthermore, we propose a novel approach for sentiment classification based on meta-level features. This supervised approach boosts existing sentiment classification of subjectivity and polarity detection on Twitter. Our results show that the combination of meta-level features provides significant improvements in performance. However, we observe that there are important differences that rely on the type of lexical resource, the dataset used to build the model, and the learning strategy. Experimental results indicate that manually generated lexicons are focused on emotional words, being very useful for polarity prediction. On the other hand, lexicons generated with automatic methods include neutral words, introducing noise in the detection of subjectivity. Our findings indicate that polarity and subjectivity prediction are different dimensions of the same problem, but they need to be addressed using different subspace features. Lexicon-based approaches are recommendable for polarity, and stylistic part-of-speech based approaches are meaningful for subjectivity. With this research we offer a more global insight of the resource components for the complex task of classifying human emotion and opinion.
机译:人们通过表达自己的观点和情感来对事件,主题和实体做出反应。这些反应可以对应很宽的强度,从非常温和到强烈。对这些表达方式的充分处理和理解已成为商业和政治等多个领域研究的主题。在这种情况下,Twitter情感分析是自动识别和从推文中提取主观信息的任务,因此越来越受到Web挖掘社区的关注。 Twitter提供了非常有价值的洞察力,可洞悉人们的观点以及新的具有挑战性的大数据问题。这些问题包括处理大量流数据,以及在短文本消息中自动识别人类表现力。在该领域,已经提出了几种方法和词汇资源,以便从自然语言文本的句法和语义两个层面上提取情感指标。这些方法解决了意见的不同方面,例如主观性,极性,强度和情感。本文是对这些针对不同情感范围的资源如何相互补充的首次研究。为此,我们确定了其中某些资源比其他资源更有用的方案。此外,我们提出了一种基于元层次特征的情感分类新方法。这种受监督的方法促进了Twitter上对主观性和极性检测的现有情感分类。我们的结果表明,元级功能的组合可显着提高性能。但是,我们观察到,依赖于词汇资源的类型,用于构建模型的数据集和学习策略存在重要的区别。实验结果表明,人工生成的词典集中在情感词上,对于极性预测非常有用。另一方面,用自动方法生成的词典包括中性词,从而在主观性检测中引入了噪声。我们的发现表明,极性和主观性预测是同一问题的不同维度,但需要使用不同的子空间特征来解决。基于词汇的方法对于极性是可取的,而基于风格词性的方法对于主观性是有意义的。通过这项研究,我们可以对资源成分进行更全面的了解,以完成对人类情感和观点进行分类的复杂任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号