首页> 外文会议>Workshop on Online Abuse and Harms >Six Attributes of Unhealthy Conversations
【24h】

Six Attributes of Unhealthy Conversations

机译:不健康对话的六个属性

获取原文

摘要

We present a new dataset of approximately 44000 comments labeled by crowdworkers. Each comment is labelled as either 'healthy' or 'unhealthy', in addition to binary labels for the presence of six potentially 'unhealthy' sub-attributes: (1) hostile; (2) antagonistic, insulting, provocative or trolling: (3) dismissive; (4) condescending or patronising; (5) sarcastic: and/or (6) an unfair generalisation. Each label also has an associated confidence score. We argue that there is a need for datasets which enable research based on a broad notion of 'unhealthy online conversation'. We build this typology to encompass a substantial proportion of the individual comments which contribute to unhealthy online conversation. For some of these attributes, this is the first publicly available dataset of this scale. We explore the quality of the dataset, present some summary statistics and initial models to illustrate the utility of this data, and highlight limitations and directions for further research.
机译:我们展示了一个由人群公司标记的大约44000点评论的新数据集。除了在存在六个可能的“不健康”子属性的二进制标签之外,每个评论都标记为“健康”或“不健康”:(1)敌对; (2)对抗,侮辱,挑衅或拖钓:(3)解除; (4)居高临下或光顾; (5)讽刺:和/或(6)不公平的泛化。每个标签也有相关的信心得分。我们认为,需要基于对“不健康的在线谈话”的广泛概念来实现研究的数据集。我们建立这个类型的类型,以包括一个有助于不健康的在线谈话的个人评论。对于其中一些属性,这是此比例的第一个公开的数据集。我们探讨了数据集的质量,提供了一些摘要统计和初始模型,以说明此数据的实用性,并突出显示进一步研究的限制和方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号