Detecting spamming activities in twitter based on deep-learning technique

TingminWu; ShengWen; Shigang Liu; Jun Zhang; Yang Xiang; Majed Alrubaian; MohammadMehedi Hassan

首页> 外文期刊>Concurrency and Computation >Detecting spamming activities in twitter based on deep-learning technique

【24h】

Detecting spamming activities in twitter based on deep-learning technique

机译：基于深度学习技术的Twitter垃圾邮件活动检测

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Twitter spam has long been a critical but difficult problem to be addressed. So far, researchers have developed a series ofmachine learning–based methods and blacklisting techniques to detect spamming activities on Twitter. According to our investigation, current methods and techniques have achieved the accuracy of around 87%. However, because of the problems of spam drift and information fabrication, these machine learning–based methods cannot efficiently detect spam activities in real-life scenarios. Meanwhile, the blacklisting method also cannot catch up with the variations of spamming activities, as manually inspecting suspicious URLs is extremely timeconsuming. In this paper,weproposed a novel technique based on deep-learning technique to address the above challenges. The syntax of each tweet will be learned throughWordVector and trained bydeep learning.We then constructed a binary classifier todifferentiatespamandregular tweets. In experiments, we collected and labeled a 10-day real tweet dataset as ground truth to evaluate our proposed method.We first went for empirical analysis with a series of comparisons to other methods: (1) performance of different classifiers, (2) other existing text-based methods, and (3) nontext-based detection techniques. According to the experiment results, our proposed method largely outperformed previous methods.We further conducted principle component analysis on typical methods to theoretically justify the outperformance of our method.We extracted all kinds of features via dimensionality reduction. It was found that our featuresweremost distinct among all the detection methods. This well demonstrated the outperformance of our method.

机译：长期以来，Twitter垃圾邮件一直是一个关键但很难解决的问题。到目前为止，研究人员已经开发了一系列基于机器学习的方法和黑名单技术，以检测Twitter上的垃圾邮件活动。根据我们的调查，当前的方法和技术已达到87％左右的准确性。但是，由于垃圾邮件漂移和信息制造的问题，这些基于机器学习的方法无法有效地检测实际场景中的垃圾邮件活动。同时，由于手动检查可疑URL非常耗时，因此黑名单方法也无法跟上垃圾邮件活动的变化。针对以上挑战，本文提出了一种基于深度学习的新技术。每个推文的语法将通过WordVector进行学习，并通过深度学习进行训练。然后，我们构造了一个二元分类器来区分垃圾邮件。在实验中，我们收集并标记了10天的真实推文数据集作为基础事实，以评估我们提出的方法。我们首先进行了实证分析，并与其他方法进行了一系列比较：（1）不同分类器的性能;（2）其他现有的基于文本的方法，以及（3）非基于文本的检测技术。根据实验结果，我们提出的方法大大优于以前的方法。我们进一步对典型方法进行了主成分分析，从理论上证明了该方法的性能。通过降维提取了各种特征。发现我们的特征在所有检测方法中最不同。这很好地证明了我们方法的优越性。

著录项

来源
《Concurrency and Computation》 |2017年第19期|e4209.1-e4209.11|共11页
作者
TingminWu; ShengWen; Shigang Liu; Jun Zhang; Yang Xiang; Majed Alrubaian; MohammadMehedi Hassan;
展开▼
作者单位

School of Information Technology, Deakin University, Victoria 3125, Australia;

School of Information Technology, Deakin University, Victoria 3125, Australia;

School of Information Technology, Deakin University, Victoria 3125, Australia;

School of Information Technology, Deakin University, Victoria 3125, Australia;

School of Information Technology, Deakin University, Victoria 3125, Australia;

College of Computer and Information Sciences King Saud University, Riyadh 11543,Saudi Arabia;

College of Computer and Information Sciences King Saud University, Riyadh 11543,Saudi Arabia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
deep learning; social media security; twitter spam detection;

机译：深度学习社交媒体安全;Twitter垃圾邮件检测;

相似文献

外文文献
中文文献
专利

1. Detecting and Characterizing Arab Spammers Campaigns in Twitter [J] . Reem Alharthi, Areej Alhothali, Kawthar Moria Procedia Computer Science . 2019,第1期

机译：在Twitter中检测和表征阿拉伯垃圾邮件发送者活动
2. Feature engineering for detecting spammers on Twitter: Modelling and analysis [J] . Wafa Herzallah, Hossam Faris, Omar Adwan Journal of Information Science . 2018,第2期

机译：用于在Twitter上检测垃圾邮件发送者的功能工程：建模和分析
3. Detecting Streaming of Twitter Spam Using Hybrid Method [J] . Murugan N. Senthil, Devi G. Usha Wireless personal communications: An Internaional Journal . 2018,第2期

机译：使用混合方法检测Twitter垃圾邮件流
4. A Multi-classifier Framework for Detecting Spam and Fake Spam Messages in Twitter [C] . R. Jeberson Retna Raj, Senduru Srinivasulu, Aldrin Ashutosh International Conference on Communication Systems and Network Technologies . 2020

机译：用于在Twitter中检测垃圾邮件和虚假垃圾邮件的多分类器框架
5. Detecting Organizational Accounts from Twitter Based on Network and Behavioral Factors [D] . Gore, Chinmay Chandrashekhar. 2017

机译：基于网络和行为因素从Twitter检测组织帐户
6. Optimization of a Deep-Learning Method Based on the Classification of Images Generated by Parameterized Deep Snap a Novel Molecular-Image-Input Technique for Quantitative Structure–Activity Relationship (QSAR) Analysis [O] . Yasunari Matsuzaka, Yoshihiro Uesawa 2019

机译：基于参数化深度捕捉生成图像分类的深度学习方法的优化-一种定量构效关系（QSAR）分析的分子图像输入新技术
7. Techniques to Detect Spammers in Twitter- A Survey [O] . Monika Verma, Ph. D. Scholar 2014

机译：在Twitter中检测垃圾邮件发送者的技术-一项调查

Detecting spamming activities in twitter based on deep-learning technique

摘要

著录项

相似文献

相关主题

期刊订阅