首页> 外文会议>Annual meeting of the Association for Computational Linguistics >CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech
【24h】

CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech

机译:CONAN-通过Nichesourcing进行叙事叙事:应对多语言仇恨言论的多语言数据集

获取原文

摘要

Although there is an unprecedented effort to provide adequate responses in terms of laws and policies to hate content on social media platforms, dealing with hatred online is still a tough problem. Tackling hate speech in the standard way of content deletion or user suspension may be charged with censorship and overblocking. One alternate strategy, that has received little attention so far by the research community, is to actually oppose hate content with counter-narratives (i.e. informed textual responses). In this paper, we describe the creation of the first large-scale, multilingual, expert-based dataset of hate speech/counter-narrative pairs. This dataset has been built with the effort of more than 100 operators from three different NGOs that applied their training and expertise to the task. Together with the collected data we also provide additional annotations about expert demographics, hate and response type, and data augmentation through translation and paraphrasing. Finally, we provide initial experiments to assess the quality of our data.
机译:尽管在法律和政策方面做出了前所未有的努力以对社交媒体平台上的仇恨内容做出足够的回应,但是在网上处理仇恨仍然是一个棘手的问题。以标准的内容删除或用户暂停的方式来处理仇恨言论可能会受到审查和封锁的影响。迄今为止,研究界很少关注的另一种策略是实际上以反叙述反对仇恨内容(即知情的文字回复)。在本文中,我们描述了仇恨言语/反叙事对的第一个大规模,多语言,基于专家的数据集的创建。该数据集是在来自三个不同NGO的100多名操作人员的努力下建立的,他们将他们的培训和专业知识应用于任务。与收集的数据一起,我们还提供有关专家人口统计,仇恨和回应类型以及通过翻译和释义进行数据扩充的其他注释。最后,我们提供了初步实验来评估我们数据的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号