首页> 外文会议>Annual meeting of the Association for Computational Linguistics >CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech
【24h】

CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech

机译:柯南 - 通过尼基因策划的逆叙事:在线仇恨言论的响应的多语言数据集

获取原文

摘要

Although there is an unprecedented effort to provide adequate responses in terms of laws and policies to hate content on social media platforms, dealing with hatred online is still a tough problem. Tackling hate speech in the standard way of content deletion or user suspension may be charged with censorship and overblocking. One alternate strategy, that has received little attention so far by the research community, is to actually oppose hate content with counter-narratives (i.e. informed textual responses). In this paper, we describe the creation of the first large-scale, multilingual, expert-based dataset of hate speech/counter-narrative pairs. This dataset has been built with the effort of more than 100 operators from three different NGOs that applied their training and expertise to the task. Together with the collected data we also provide additional annotations about expert demographics, hate and response type, and data augmentation through translation and paraphrasing. Finally, we provide initial experiments to assess the quality of our data.
机译:虽然有前所未有的努力,在法律和政策方面提供足够的反应,但在社交媒体平台上讨厌内容,但在线处理仇恨仍然是一个艰难的问题。在标准的内容删除或用户暂停方式中处理讨论言论可能会被指控审查和过度封闭。到目前为止,迄今为止,研究界几乎没有注意的替代策略是实际反对与反叙事(即知情文本答复)反对仇恨内容。在本文中,我们描述了创建了第一个大规模,多语言,基于专家的仇恨语音/反叙事对的数据集。此数据集已建成超过100个来自三个不同非政府组织的运营商的努力,这些非NGOS将其培训和专业知识应用于任务。与收集的数据一起,我们还提供了关于专家人口统计数据,仇恨和响应类型的额外注释,以及通过翻译和解释的数据增强。最后,我们提供了评估数据质量的初步实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号