首页> 外文会议>Workshop on Arabic Natural Language Processing >Hate Speech Detection in Saudi Twittersphere: A Deep Learning Approach
【24h】

Hate Speech Detection in Saudi Twittersphere: A Deep Learning Approach

机译:讨厌沙特·德雷斯克雷斯的讲话检测:深度学习方法

获取原文

摘要

With the rise of hate speech phenomena in Twittersphere,significant research efforts have been undertaken to provide automatic solutions for detecting hate speech,varying from simple machine learning models to more complex deep neural network models. Despite that,research works investigating hate speech problem in Arabic are still limited. This paper,therefore,aims to investigate several neural network models based on Convolutional Neural Network (CNN) and Recurrent Neural Networks (RNN) to detect hate speech in Arabic tweets. It also evaluates the recent language representation model BERT on the task of Arabic hate speech detection. To conduct our experiments,we firstly built a new hate speech dataset that contains 9.316 annotated tweets. Then,we conducted a set of experiments on two datasets to evaluate four models: CNN,GRU,CNN+GRU and BERT. Our experimental results on our dataset and an out-domain dataset show that CNN model gives the best performance with an F1-score of 0.79 and AUROC of 0.89.
机译:随着Twittersphere仇恨语音现象的兴起,已经开展了重大的研究,为检测仇恨言论提供自动解决方案,从简单的机器学习模型变化到更复杂的深度神经网络模型。尽管如此,调查阿拉伯语仇恨讲话问题的研究工作仍然有限。因此,本文旨在研究基于卷积神经网络(CNN)和经常性神经网络(RNN)的几个神经网络模型,以检测阿拉伯语推文中的仇恨语音。它还评估了最近的语言表示模型伯特对阿拉伯语仇恨语音检测的任务。要进行我们的实验,我们首先建立了一个新的仇恨语音数据集,其中包含9.316个注释的推文。然后,我们在两个数据集中进行了一组实验,以评估四种型号:CNN,GRU,CNN + GRU和伯特。我们在我们的数据集和外域数据集上的实验结果表明,CNN模型提供了最佳性能,F1分数为0.79和0.89。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号