首页> 外文会议>10th workshop on computational approaches to subjectivity, sentiment and social media analysis >Exploring Fine-Tuned Embeddings that Model Intensifies for Emotion Analysis
【24h】

Exploring Fine-Tuned Embeddings that Model Intensifies for Emotion Analysis

机译:探索模型增强的情绪分析嵌入

获取原文
获取原文并翻译 | 示例

摘要

Adjective phrases like "a little bit surprised", "completely shocked", or "not stunned at all" are not handled properly by currently published state-of-the-art emotion classification and intensity prediction systems which use predominantly non-contextualized word embeddings as input. Based on this finding, we analyze differences between embeddings used by these systems in regard to their capability of handling such cases. Furthermore, we argue that intensifiers in context of emotion words need special treatment, as is established for sentiment polarity classification, but not for more fine-grained emotion prediction. To resolve this issue, we analyze different aspects of a post-processing pipeline which enriches the word representations of such phrases. This includes expansion of semantic spaces at the phrase level and sub-word level followed by retrofitting to emotion lexica. We evaluate the impact of these steps with A La Carte and Bag-of-Substrings extensions based on pretrained GloVe, Word2vec, and fastText embeddings against a crowd-sourced corpus of intensity annotations for tweets containing our focus phrases. We show that the fastText-based models do not gain from handling these specific phrases under inspection. For Word2vec embeddings, we show that our post-processing pipeline improves the results by up to 8% on a novel dataset densely populated with intensifiers.
机译:当前发布的最新情感分类和强度预测系统无法正确处理形容词短语,例如“有点惊讶”,“完全震惊”或“完全不震惊”主要是非上下文的词嵌入作为输入。基于此发现,我们分析了这些系统使用的嵌入之间在处理此类案件的能力方面的差异。此外,我们认为情感词上下文中的增强器需要特殊处理,这是为情感极性分类而建立的,但对于更细粒度的情感预测则不需要。为了解决此问题,我们分析了后处理管道的不同方面,该管道丰富了此类短语的单词表示形式。这包括在短语级别和子单词级别扩展语义空间,然后改编为情感词典。我们使用基于经过预先训练的GloVe,Word2vec和fastText嵌入的点菜和袋子字符串扩展,对包含我们关注焦点的推文的众包强度注释语料库,评估了这些步骤的影响。我们表明,基于fastText的模型无法从检查中的这些特定短语中受益。对于Word2vec嵌入,我们显示了在密集填充了增强器的新数据集上,后处理管道将结果提高了8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号