Measuring Sentiment Annotation Complexity of Text

机译：测量文本的情感注释复杂度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The effort required for a human annota-tor to detect sentiment is not uniform for all texts, irrespective of his/her expertise. We aim to predict a score that quantifies this effort, using linguistic properties of the text. Our proposed metric is called Sentiment Annotation Complexity (SAC). As for training data, since any direct judgment of complexity by a human annota-tor is fraught with subjectivity, we rely on cognitive evidence from eye-tracking. The sentences in our dataset are labeled with SAC scores derived from eye-fixation duration. Using linguistic features and annotated SACs, we train a regressor that predicts the SAC with a best mean error rate of 22.02% for five-fold cross-validation. We also study the correlation between a human annotator's perception of complexity and a machine's confidence in polarity determination. The merit of our work lies in (a) deciding the sentiment annotation cost in, for example, a crowdsourcing setting, (b) choosing the right classifier for sentiment prediction.

机译：人工注释者检测情感所需的努力并非在所有文本中都是统一的，而与他/她的专业知识无关。我们旨在使用文本的语言属性来预测可量化此工作的得分。我们提出的度量标准称为情感注释复杂度（SAC）。至于训练数据，由于人类注释者对复杂性的任何直接判断都充满主观性，因此我们依赖于眼动追踪的认知证据。我们数据集中的句子都标有SAC分数，该分数源自注视时间。使用语言功能和带注释的SAC，我们训练了一个回归器，该回归器对五重交叉验证的最佳平均错误率预测为22.02％。我们还研究了人类注释者对复杂性的感知与机器对极性确定的信心之间的相关性。我们工作的优点在于（a）在众包环境中确定情感注释成本，（b）选择用于情感预测的正确分类器。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2014年|36-41|共6页
会议地点
作者
Aditya Joshi; Abhijit Mishra; Nivvedan Senthamilselvan; Pushpak Bhattacharyya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Measuring Algebraic Complexity of Text Understanding Based on Human Concept Learning [J] . Luo X., Zhang J., Li Q., Human-Machine Systems, IEEE Transactions on . 2014,第5期

机译：基于人类概念学习的文本理解代数复杂性度量
2. Measuring complexity with multifractals in texts. Translation effects [J] . Ausloos M. Chaos, Solitons and Fractals: Applications in Science and Engineering: An Interdisciplinary Journal of Nonlinear Science . 2012,第11期

机译：用文本中的多重形来衡量复杂性。翻译效果
3. Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text [J] . Brett R. South, Danielle Mowery, Ying Suo, Journal of biomedical informatics. . 2014,第Null期

机译：评估机器预注释和交互式注释界面对手动取消识别临床文本的影响
4. Measuring Sentiment Annotation Complexity of Text [C] . Aditya Joshi, Abhijit Mishra, Nivvedan Senthamilselvan, Annual meeting of the Association for Computational Linguistics . 2014

机译：测量情绪注释文本的复杂性
5. Modeling new product success from component measures of product advantage: A model utilizing automated text classification and sentiment analysis. [D] . Akerman, Ashley Nolen. 2014

机译：从产品优势的组件度量模型为新产品成功建模：利用自动文本分类和情感分析的模型。
6. A versatile framework for resource-limited sentiment articulation annotation and analysis of short texts [O] . Vuk Batanović, Miloš Cvetanović, Boško Nikolić 2020

机译：用于资源有限的情感注释和短文本分析的多功能框架
7. data-contrast="none"> data-ccp-parastyle="annotation text">Association data-contrast="none"> data-ccp-parastyle="annotation text">s data-contrast="none"> data-ccp-parastyle="annotation text"> between data-contrast="none"> data-ccp-parastyle="annotation text">the data-contrast="none"> data-ccp-parastyle="annotation text"> Structure of Urban Landscape and Particulate Matter: A STURLA Case Study in Philadelphia, PA [O] . Lucas Cummings, Justin Stewart, Peleg Kremer, 2021

机译：data-contrast =“none”> data-ccp-parastyle =“注释文本”>关联 data-contrast =“none”> data-ccp-parastyle =“注释文本”> s 数据 - 对比度=“无”> <跨度数据-CCP-Parastyle =“注释文本”> <跨度数据 - 对比度=“无”> <跨度数据-CCP-Parastyle =“注释文本”> <跨度数据 - 对比度=“无”> <跨度数据-CCP-Parastyle =“注释文本”>城市景观和颗粒物的结构：PA的费城的Sturla案例研究

Measuring Sentiment Annotation Complexity of Text

摘要

著录项

相似文献

相关主题

期刊订阅