首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Best-Worst Scaling More Reliable than Rating Scales: A Case Study on Sentiment Intensity Annotation
【24h】

Best-Worst Scaling More Reliable than Rating Scales: A Case Study on Sentiment Intensity Annotation

机译:最差的规模比评级量表更可靠:以情感强度注释为例

获取原文

摘要

Rating scales are a widely used method for data annotation; however, they present several challenges, such as difficulty in maintaining inter- and intra-annotator consistency. Best-worst scaling (BWS) is an alternative method of annotation that is claimed to produce high-quality annotations while keeping the required number of annotations similar to that of rating scales. However, the veracity of this claim has never been systematically established. Here for the first time, we set up an experiment that directly compares the rating scale method with BWS. We show that with the same total number of annotations, BWS produces significantly more reliable results than the rating scale.
机译:评定量表是一种广泛使用的数据注释方法。然而,它们提出了一些挑战,例如难以保持注释者之间和注释者内部的一致性。最差标定(BWS)是注解的一种替代方法,据称可产生高质量注解,同时保持所需的注解数量与等级量表相似。但是,此主张的准确性从未得到系统地确定。在这里,我们首次建立了一个直接将评分量表方法与BWS进行比较的实验。我们显示,在批注总数相同的情况下,BWS所产生的结果要比评级量表可靠得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号