A Benchmark Dataset for Learning to Intervene in Online Hate Speech

机译：学习干预网上仇恨言论的基准数据集

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Countering online hate speech is a critical yet challenging task, but one which can be aided by the use of Natural Language Processing (NLP) techniques. Previous research has primarily focused on the development of NLP methods to automatically and effectively detect online hate speech while disregarding further action needed to calm and discourage individuals from using hate speech in the future. In addition, most existing hate speech datasets treat each post as an isolated instance, ignoring the conversational context. In this paper, we propose a novel task of generative hate speech intervention, where the goal is to automatically generate responses to intervene during online conversations that contain hate speech. As a part of this work, we introduce two fully-labeled large-scale hate speech intervention datasets~1 collected from Gab~2 and Reddit~3. These datasets provide conversation segments, hate speech labels, as well as intervention responses written by Mechanical Turk~4 Workers. In this paper, we also analyze the datasets to understand the common intervention strategies and explore the performance of common automatic response generation methods on these new datasets to provide a benchmark for future research.

机译：对抗在线仇恨言论是一项关键但具有挑战性的任务，但是可以通过使用自然语言处理（NLP）技术来帮助完成这一任务。先前的研究主要集中在NLP方法的开发上，该方法可以自动有效地检测在线仇恨语音，而忽略了进一步采取措施来平息和阻止个人将来使用仇恨语音。此外，大多数现有的仇恨言论数据集将每个帖子视为孤立的实例，而忽略了对话上下文。在本文中，我们提出了一项产生性仇恨言论干预的新任务，其目标是在包含仇恨言论的在线对话期间自动生成干预干预的响应。作为这项工作的一部分，我们介绍了两个从Gab〜2和Reddit〜3收集的完全标记的大规模仇恨语音干预数据集〜1。这些数据集提供对话段，讨厌的语音标签以及Mechanical Turk〜4 Workers编写的干预响应。在本文中，我们还分析了数据集以了解常见的干预策略，并探索了在这些新数据集上常见的自动响应生成方法的性能，从而为将来的研究提供了基准。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|4754-4763|共10页
会议地点
作者
Jing Qian; Anna Bethke; Yinyin Liu; Elizabeth Belding; William Yang Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. How well do hate speech, toxicity, abusive and offensive language classification models generalize across datasets? [J] . Paula Fortuna, Juan Soler-Company, Leo Wanner Information Processing & Management . 2021,第3期

机译：仇恨言语，毒性，滥用和令人反感的语言分类模型如何概括到数据集？
2. Freedom of speech at the intersection of racist speech and online political hate speech [J] . Charlotte Elliott-Harvey European Journal of Communication . 2021,第3期

机译：在种族主义演讲和在线政治仇恨中的交叉口言论自由
3. AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News & Hate Speech Detection Dataset [J] . Mohamed Seghir Hadj Ameur, Hassina Aliane Procedia Computer Science . 2021,第a期

机译：Aracovid19-MFH：阿拉伯Covid-19多标签假新闻＆amp; 讨厌语音检测数据集
4. A Benchmark Dataset for Learning to Intervene in Online Hate Speech [C] . Jing Qian, Anna Bethke, Yinyin Liu, International joint conference on natural language processing . 2019

机译：用于学习介入在线仇恨语音的基准数据集
5. On the Detection of Hate Speech, Hate Speakers and Polarized Groups in Online Social Media [D] . Warmsley, Dana. 2017

机译：在线社交媒体中仇恨言论，仇恨演说者和两极分化群体的检测
6. Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach [O] . Rama Adiga 2021

机译：使用机器学习方法从疟疾细胞毒性T细胞表位进行基准测试数据集
7. A Benchmark Dataset for Learning to Intervene in Online Hate Speech [O] . Jing Qian, Anna Bethke, Yinyin Liu, 2019

机译：用于学习介入在线仇恨语音的基准数据集

A Benchmark Dataset for Learning to Intervene in Online Hate Speech

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅