Measuring Sentiment Annotation Complexity of Text

机译：测量情绪注释文本的复杂性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The effort required for a human annota-tor to detect sentiment is not uniform for all texts, irrespective of his/her expertise. We aim to predict a score that quantifies this effort, using linguistic properties of the text. Our proposed metric is called Sentiment Annotation Complexity (SAC). As for training data, since any direct judgment of complexity by a human annota-tor is fraught with subjectivity, we rely on cognitive evidence from eye-tracking. The sentences in our dataset are labeled with SAC scores derived from eye-fixation duration. Using linguistic features and annotated SACs, we train a regressor that predicts the SAC with a best mean error rate of 22.02% for five-fold cross-validation. We also study the correlation between a human annotator's perception of complexity and a machine's confidence in polarity determination. The merit of our work lies in (a) deciding the sentiment annotation cost in, for example, a crowdsourcing setting, (b) choosing the right classifier for sentiment prediction.

机译：无论他/她的专业知识如何，所有文本都不统一，对人类Annota-Tor检测情绪所需的努力并不统一。我们的目标是使用文本的语言属性来预测量化这项努力的分数。我们所提出的指标称为情绪注释复杂性（SAC）。至于培训数据，由于人类Annota-tor对复杂性的任何直接判断都充满了主观性，因此我们依赖于追踪的认知证据。我们数据集中的句子标有来自眼固定持续时间的SAC分数。使用语言特征和带注释的囊，我们训练一个回归，以预测囊的最佳误差率为22.02％，对于五倍交叉验证。我们还研究了人类注入者对复杂性的看法与机器对极性决定的信心之间的相关性。我们的工作的优点在于（a）决定情绪注释成本，例如，众包设置，（b）选择右分类器进行情绪预测。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2014年||共6页
会议地点
作者
Aditya Joshi; Abhijit Mishra; Nivvedan Senthamilselvan; Pushpak Bhattacharyya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Measuring Algebraic Complexity of Text Understanding Based on Human Concept Learning [J] . Luo X., Zhang J., Li Q., Human-Machine Systems, IEEE Transactions on . 2014,第5期

机译：基于人类概念学习的文本理解代数复杂性度量
2. Measuring complexity with multifractals in texts. Translation effects [J] . Ausloos M. Chaos, Solitons and Fractals: Applications in Science and Engineering: An Interdisciplinary Journal of Nonlinear Science . 2012,第11期

机译：用文本中的多重形来衡量复杂性。翻译效果
3. Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text [J] . Brett R. South, Danielle Mowery, Ying Suo, Journal of biomedical informatics. . 2014,第Null期

机译：评估机器预注释和交互式注释界面对手动取消识别临床文本的影响
4. Measuring Sentiment Annotation Complexity of Text [C] . Aditya Joshi, Abhijit Mishra, Nivvedan Senthamilselvan, Annual meeting of the Association for Computational Linguistics . 2014

机译：测量文本的情感注释复杂度
5. Modeling new product success from component measures of product advantage: A model utilizing automated text classification and sentiment analysis. [D] . Akerman, Ashley Nolen. 2014

机译：从产品优势的组件度量模型为新产品成功建模：利用自动文本分类和情感分析的模型。
6. A versatile framework for resource-limited sentiment articulation annotation and analysis of short texts [O] . Vuk Batanović, Miloš Cvetanović, Boško Nikolić 2020

机译：用于资源有限的情感注释和短文本分析的多功能框架
7. data-contrast="none"> data-ccp-parastyle="annotation text">Association data-contrast="none"> data-ccp-parastyle="annotation text">s data-contrast="none"> data-ccp-parastyle="annotation text"> between data-contrast="none"> data-ccp-parastyle="annotation text">the data-contrast="none"> data-ccp-parastyle="annotation text"> Structure of Urban Landscape and Particulate Matter: A STURLA Case Study in Philadelphia, PA [O] . Lucas Cummings, Justin Stewart, Peleg Kremer, 2021

机译：data-contrast =“none”> data-ccp-parastyle =“注释文本”>关联 data-contrast =“none”> data-ccp-parastyle =“注释文本”> s 数据 - 对比度=“无”> <跨度数据-CCP-Parastyle =“注释文本”> <跨度数据 - 对比度=“无”> <跨度数据-CCP-Parastyle =“注释文本”> <跨度数据 - 对比度=“无”> <跨度数据-CCP-Parastyle =“注释文本”>城市景观和颗粒物的结构：PA的费城的Sturla案例研究

Measuring Sentiment Annotation Complexity of Text

摘要

著录项

相似文献

相关主题

期刊订阅