首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Measuring Sentiment Annotation Complexity of Text
【24h】

Measuring Sentiment Annotation Complexity of Text

机译:测量文本的情感注释复杂度

获取原文

摘要

The effort required for a human annota-tor to detect sentiment is not uniform for all texts, irrespective of his/her expertise. We aim to predict a score that quantifies this effort, using linguistic properties of the text. Our proposed metric is called Sentiment Annotation Complexity (SAC). As for training data, since any direct judgment of complexity by a human annota-tor is fraught with subjectivity, we rely on cognitive evidence from eye-tracking. The sentences in our dataset are labeled with SAC scores derived from eye-fixation duration. Using linguistic features and annotated SACs, we train a regressor that predicts the SAC with a best mean error rate of 22.02% for five-fold cross-validation. We also study the correlation between a human annotator's perception of complexity and a machine's confidence in polarity determination. The merit of our work lies in (a) deciding the sentiment annotation cost in, for example, a crowdsourcing setting, (b) choosing the right classifier for sentiment prediction.
机译:人工注释者检测情感所需的努力并非在所有文本中都是统一的,而与他/她的专业知识无关。我们旨在使用文本的语言属性来预测可量化此工作的得分。我们提出的度量标准称为情感注释复杂度(SAC)。至于训练数据,由于人类注释者对复杂性的任何直接判断都充满主观性,因此我们依赖于眼动追踪的认知证据。我们数据集中的句子都标有SAC分数,该分数源自注视时间。使用语言功能和带注释的SAC,我们训练了一个回归器,该回归器对五重交叉验证的最佳平均错误率预测为22.02%。我们还研究了人类注释者对复杂性的感知与机器对极性确定的信心之间的相关性。我们工作的优点在于(a)在众包环境中确定情感注释成本,(b)选择用于情感预测的正确分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号