...
首页> 外文期刊>Computational intelligence and neuroscience >Automatic Construction and Global Optimization of a Multisentiment Lexicon
【24h】

Automatic Construction and Global Optimization of a Multisentiment Lexicon

机译:多学期lexicon的自动施工和全局优化

获取原文
获取原文并翻译 | 示例

摘要

Manual annotation of sentiment lexicons costs too much labor and time, and it is also difficult to get accurate quantification of emotional intensity. Besides, the excessive emphasis on one specific field has greatly limited the applicability of domain sentiment lexicons (Wang et al., 2010). This paper implements statistical training for large-scale Chinese corpus through neural network language model and proposes an automatic method of constructing a multidimensional sentiment lexicon based on constraints of coordinate offset. In order to distinguish the sentiment polarities of those words which may express either positive or negative meanings in different contexts, we further present a sentiment disambiguation algorithm to increase the flexibility of our lexicon. Lastly, we present a global optimization framework that provides a unified way to combine several human-annotated resources for learning our 10-dimensional sentiment lexicon SentiRuc. Experiments show the superior performance of SentiRuc lexicon in category labeling test, intensity labeling test, and sentiment classification tasks. It is worth mentioning that, in intensity label test, SentiRuc outperforms the second place by 21 percent.
机译:手动注释情绪词典成本过多的劳动力和时间,并且也难以准确地量化情绪强度。此外,对一个特定领域的过度强调极大地限制了域情绪词典的适用性(Wang等,2010)。本文通过神经网络语言模型实现大规模中文语料库的统计培训,并提出了一种基于坐标偏移约束构建多维情绪词典的自动方法。为了区分这些词的情感极性,其可以表达在不同背景下的正或负含义的阳性或负含义,我们进一步提高了情绪消歧算法,以提高我们的词典的灵活性。最后,我们展示了一个全局优化框架,提供了一个统一的方式来结合几种人类注释的资源来学习我们的10维情绪Lexicon Sentifuc。实验表明Sentiruc Lexicon在类别标记测试,强度标记测试和情感分类任务中的优越性。值得一提的是,在强度标签测试中,Sentiruc优先于第二名达到21%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号