Intelligent Analysis of Arabic Tweets for Detection of Suspicious Messages

Mohammed A. AlGhamdi; Murtaza Ali Khan

首页> 外文期刊>Arabian Journal for Science and Engineering. Section A, Sciences >Intelligent Analysis of Arabic Tweets for Detection of Suspicious Messages

【24h】

Intelligent Analysis of Arabic Tweets for Detection of Suspicious Messages

机译：检测可疑信息的阿拉伯语推文的智能分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the widespread use of messaging via social networks such as Twitter, Instagram, and Facebook, it is becoming imperative for researchers to devise intelligent systems for data analytics in the range of domains like business, health, communication, security, etc. The complex morphological and syntactic structure of Arabic sentences makes them difficult to analyze. This paper presents an intelligent system to analyze Arabic tweets for detecting suspicious messages. We acquired Arabic tweet data from micro-blogging social network Twitter via Twitter Streaming Application Programming Interface and save it in a required file format. The system tokenizes and preprocesses the tweet dataset. Manual labeling is performed on tweet dataset for suspicious (label 1) and not-suspicious (label 0) classes. The labeled tweet dataset is used to train a classifier using supervised machine learning algorithms for the detection of suspicious activities. During the testing phase, the system processes unlabeled tweet data and detects either it belongs to a suspicious or not-suspicious class.We tested the system using six supervised machine learning algorithms: (1) decision tree, (2) k-nearest neighbors, (3) linear discriminant algorithm, (4) support vector machine, (5) artificial neural networks, and (6) long short-term memory networks. A comparative analysis in terms of accuracy, execution time, and confusion matrices of the six classifiers is presented. The execution speed of ANN is lowest. In terms of predicting correct results, the SVM performs best among all the classifiers and yields 86.72% mean accuracy. The major outcomes of this work are development of labeled dataset of Arabic tweets, an intelligent behavior analysis of tweets using six machine learning algorithms to detect suspicious messages, a comparative analysis of six machine learning algorithms, and a development of a statistical benchmark that can be used for future studies about the detection of crimes on social media.

机译：通过广泛使用消息传递，通过诸如Twitter，Instagram和Facebook等社交网络，研究人员在商业，健康，沟通，安全等范围内为数据分析设计智能系统制定了智能系统。复杂的形态学阿拉伯语句子的句法结构使他们难以分析。本文介绍了一个智能系统，分析了用于检测可疑信息的阿拉伯语推文。我们通过Twitter流应用程序编程接口从微博社交网络推特中获取了阿拉伯语推文数据，并以所需的文件格式保存。系统授予和预处理推文数据集。对Tweet DataSet进行手动标签，用于可疑（标签1）和不可疑（标签0）类。标记的Tweet DataSet用于使用监督机器学习算法训练分类器以检测可疑活动。在测试阶段期间，系统处理未标记的推文数据并检测它属于可疑或不可疑的类。我们使用六个监督机器学习算法测试了系统：（1）决策树，（2）k最近邻居，（3）线性判别算法，（4）支持向量机，（5）人工神经网络，（6）长期短期内存网络。提出了六分类器的准确性，执行时间和混淆矩阵方面的比较分析。 ANN的执行速度最低。在预测正确的结果方面，SVM在所有分类器中表现最佳，并产生86.72％的平均准确性。这项工作的主要结果是开发标签的阿拉伯语推文数据集，使用六种机器学习算法进行推文的智能行为分析，以检测可疑信息，六种机器学习算法的比较分析以及可以成为统计基准的发展用于未来关于社交媒体犯罪的研究。

著录项

来源
《Arabian Journal for Science and Engineering. Section A, Sciences》 |2020年第8期|6021-6032|共12页
作者
Mohammed A. AlGhamdi; Murtaza Ali Khan;
展开▼
作者单位

College of Computer and Information Systems Umm Al-Qura University Mecca Saudi Arabia;

College of Computer and Information Systems Umm Al-Qura University Mecca Saudi Arabia;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Twitter; Arabic tweets; Social media; Machine learning; Intelligent systems; Supervised learning;

机译：推特;阿拉伯语推文;社交媒体;机器学习;智能系统;监督学习;
入库时间 2022-08-18 21:04:44

相似文献

外文文献
中文文献
专利

1. Cyberbullying Detection by Sentiment Analysis of Tweets' Contents Written in Arabic in Saudi Arabia Society [J] . Amjad Rasmi Almutairi, Muhammad Abdullah Al-Hagery International journal of computer science and network security . 2021,第3期

机译：沙特阿拉伯社会阿拉伯语中写道的推文内容的情感分析
2. AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets [J] . Nora Al-Twairesh, Hend Al-Khalifa, AbdulMalik Al-Salman, Procedia Computer Science . 2017,第1期

机译：AraSenTi-Tweet：沙特推文阿拉伯语情感分析的语料库
3. Social mining for terroristic behavior detection through Arabic tweets characterization [J] . Wadee Alhalabi, Jari Jussila, KamalJambi, Future generation computer systems . 2021,第Mara期

机译：通过阿拉伯语推文特征对恐怖行为检测的社交挖掘
4. ESMD: Enhanced Suspicious Message Detection Framework in Instant Messaging Applications [C] . Mohd. Mahamood Ali, Mohammad S. Qaseem, Mohd. Abdul Rasheed, International Conference on Inventive Systems and Control . 2020

机译：ESMD：即时消息应用程序中的增强型可疑消息检测框架
5. Building a test collection for significant-event detection in Arabic tweets. [D] . Almerekhi, Hind Ali. 2016

机译：建立测试收集以检测阿拉伯语推文中的重大事件。
6. Review of Dissolved Oxygen Detection Technology: From Laboratory Analysis to Online Intelligent Detection [O] . Yaoguang Wei, Yisha Jiao, Dong An, 2019

机译：溶解氧检测技术述评：从实验室分析到在线智能检测
7. Intelligent Detection of False Information in Arabic Tweets Utilizing Hybrid Harris Hawks Based Feature Selection and Machine Learning Models [O] . Thaer Thaher, Mahmoud Saheb, Hamza Turabieh, 2021

机译：智能检测阿拉伯语推文中的虚假信息利用基于Hybrid Harris Hawks的特征选择和机器学习模型

Intelligent Analysis of Arabic Tweets for Detection of Suspicious Messages

摘要

著录项

相似文献

相关主题

期刊订阅