A Multilingual Evaluation for Online Hate Speech Detection

MICHELE CORAZZA; STEFANO MENINI; ELENA CABRIO; SARA TONELLI; SERENA VILLATA

首页> 外文期刊>ACM Transactions on Internet Technology >A Multilingual Evaluation for Online Hate Speech Detection

【24h】

A Multilingual Evaluation for Online Hate Speech Detection

机译：在线仇恨语音检测的多语言评估

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The increasing popularity of social media platforms such as Twitter and Facebook has led to a rise in the presence of hate and aggressive speech on these platforms. Despite the number of approaches recently proposed in the Natural Language Processing research area for detecting these forms of abusive language, the issue of identifying hate speech at scale is still an unsolved problem. In this article, we propose a robust neural architecture that is shown to perform in a satisfactory way across different languages; namely, English, Italian, and German. We address an extensive analysis of the obtained experimental results over the three languages to gain a better understanding of the contribution of the different components employed in the system, both from the architecture point of view (i.e., Long Short Term Memory, Gated Recurrent Unit, and bidirectional Long Short Term Memory) and from the feature selection point of view (i.e., ngrams, social network-specific features, emotion lexica, emojis, word embeddings). To address such in-depth analysis, we use three freely available datasets for hate speech detection on social media in English, Italian, and German.

机译：社交媒体平台（如Twitter和Facebook）的普及日益普及导致在这些平台上存在仇恨和侵略性的演讲。尽管最近在自然语言处理研究区域提出了用于检测这些形式的滥用语言的方法数量，但在规模上识别仇恨言论的问题仍然是一个未解决的问题。在本文中，我们提出了一种强大的神经结构，其显示以不同语言的令人满意的方式表现;即英语，意大利语和德语。我们解决了对三种语言获得的实验结果的广泛分析，从而更好地了解系统中所采用的不同部件的贡献，无论是从架构的角度（即长的短期内存，门控复发单元，和双向长期内记忆）和特征选择的观点（即，Ngrams，社交网络特定功能，情感Lexica，Emojis，Word Embeddings）。为了解决此类深入分析，我们在英语，意大利语和德语中使用三个可自由的数据集进行仇恨语音检测。

著录项

来源
《ACM Transactions on Internet Technology》 |2020年第2期|共22页
作者
MICHELE CORAZZA; STEFANO MENINI; ELENA CABRIO; SARA TONELLI; SERENA VILLATA;
展开▼
作者单位

Universita di Bologna;

Fondazione Bruno Kessler;

Universite Cote d'Azur;

Fondazione Bruno Kessler;

Universite Cote d'Azur;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Hate speech detection; Social media; Multilingual data; Text classification;

机译：讨厌讲话检测;社交媒体;多语言数据;文本分类;

相似文献

外文文献
中文文献
专利

1. A Multilingual Evaluation for Online Hate Speech Detection [J] . MICHELE CORAZZA, STEFANO MENINI, ELENA CABRIO, ACM Transactions on Internet Technology . 2020,第2期

机译：在线仇恨语音检测的多语言评估
2. Freedom of speech at the intersection of racist speech and online political hate speech [J] . Charlotte Elliott-Harvey European Journal of Communication . 2021,第3期

机译：在种族主义演讲和在线政治仇恨中的交叉口言论自由
3. Quarantining online hate speech: technical and ethical perspectives [J] . Stefanie Ullmann, Marcus Tomalin Ethics and information technology . 2020,第1期

机译：隔离在线仇恨言论：技术和道德观点
4. CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech [C] . Yi-Ling Chung, Elizaveta Kuzmenko, Serra Sinem Tekiroglu, Annual meeting of the Association for Computational Linguistics . 2019

机译：CONAN-通过Nichesourcing进行叙事叙事：应对多语言仇恨言论的多语言数据集
5. On the Detection of Hate Speech, Hate Speakers and Polarized Groups in Online Social Media [D] . Warmsley, Dana. 2017

机译：在线社交媒体中仇恨言论，仇恨演说者和两极分化群体的检测
6. Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media [O] . Neeraj Vashistha, Arkaitz Zubiaga 2020

机译：在线多语言仇恨语音检测：用印度和英语社交媒体试验

A Multilingual Evaluation for Online Hate Speech Detection

摘要

著录项

相似文献

相关主题

期刊订阅