...
首页> 外文期刊>ACM Transactions on Internet Technology >A Multilingual Evaluation for Online Hate Speech Detection
【24h】

A Multilingual Evaluation for Online Hate Speech Detection

机译:在线仇恨语音检测的多语言评估

获取原文
获取原文并翻译 | 示例
           

摘要

The increasing popularity of social media platforms such as Twitter and Facebook has led to a rise in the presence of hate and aggressive speech on these platforms. Despite the number of approaches recently proposed in the Natural Language Processing research area for detecting these forms of abusive language, the issue of identifying hate speech at scale is still an unsolved problem. In this article, we propose a robust neural architecture that is shown to perform in a satisfactory way across different languages; namely, English, Italian, and German. We address an extensive analysis of the obtained experimental results over the three languages to gain a better understanding of the contribution of the different components employed in the system, both from the architecture point of view (i.e., Long Short Term Memory, Gated Recurrent Unit, and bidirectional Long Short Term Memory) and from the feature selection point of view (i.e., ngrams, social network-specific features, emotion lexica, emojis, word embeddings). To address such in-depth analysis, we use three freely available datasets for hate speech detection on social media in English, Italian, and German.
机译:社交媒体平台(如Twitter和Facebook)的普及日益普及导致在这些平台上存在仇恨和侵略性的演讲。尽管最近在自然语言处理研究区域提出了用于检测这些形式的滥用语言的方法数量,但在规模上识别仇恨言论的问题仍然是一个未解决的问题。在本文中,我们提出了一种强大的神经结构,其显示以不同语言的令人满意的方式表现;即英语,意大利语和德语。我们解决了对三种语言获得的实验结果的广泛分析,从而更好地了解系统中所采用的不同部件的贡献,无论是从架构的角度(即长的短期内存,门控复发单元,和双向长期内记忆)和特征选择的观点(即,Ngrams,社交网络特定功能,情感Lexica,Emojis,Word Embeddings)。为了解决此类深入分析,我们在英语,意大利语和德语中使用三个可自由的数据集进行仇恨语音检测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号