首页> 外文期刊>Journal of Information Science >Intelligent detection of hate speech in Arabic social network:A machine learning approach
【24h】

Intelligent detection of hate speech in Arabic social network:A machine learning approach

机译:阿拉伯社交网络中仇恨言论的智能检测:机器学习方法

获取原文
获取原文并翻译 | 示例
       

摘要

Nowadays, cyber hate speech is increasingly growing, which forms a serious problem worldwide by threatening the cohesion of civil societies. Hate speech relates to using expressions or phrases that are violent, offensive or insulting for a person or a minority of people. In particular, in the Arab region, the number of Arab social media users is growing rapidly, which is accompanied with high increasing rate of cyber hate speech. This drew our attention to aspire healthy online environments that are free of hatred and discrimination. Therefore, this article aims to detect cyber hate speech based on Arabic context over Twitter platform, by applying Natural Language Processing (NLP) techniques, and machine learning methods. The article considers a set of tweets related to racism, journalism, sports orientation, terrorism and Islam. Several types of features and emotions are extracted and arranged in 15 different combinations of data. The processed dataset is experimented using Support Vector Machine (SVM), Naive Bayes (NB), Decision Tree (DT) and Random Forest (RF), in which RF with the feature set of Term Frequency-Inverse Document Frequency (TF-IDF) and profile-related features achieves the best results. Furthermore, a feature importance analysis is conducted based on RF classifier in order to quantify the predictive ability of features in regard to the hate class.
机译:如今,网络仇恨言论越来越多地增长,这通过威胁民间社会的凝聚力来形成全世界的严重问题。讨论言论涉及使用暴力,冒犯或侮辱人或少数人的表达或短语。特别是,在阿拉伯地区,阿拉伯社会媒体用户的数量正在迅速增长,伴随着网络仇恨的高度增加。这使我们注意了渴望没有仇恨和歧视的健康在线环境。因此,本文旨在通过应用自然语言处理(NLP)技术和机器学习方法,根据Twitter平台来检测网络仇恨语音。该文章考虑了一套与种族主义,新闻,体育定向,恐怖主义和伊斯兰教相关的推文。提取几种类型的特征和情绪,并以15个不同的数据组合排列。使用支持向量机(SVM),Naive Bayes(NB),决策树(DT)和随机林(RF)进行实验,其中RF具有术语频率逆文档频率(TF-IDF)的特征集和与个人资料相关的功能达到了最佳结果。此外,基于RF分类器进行特征重要性分析,以便量化讨论阶级的特征的预测能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号