...
首页> 外文期刊>Concurrency and Computation >Detecting spamming activities in twitter based on deep-learning technique
【24h】

Detecting spamming activities in twitter based on deep-learning technique

机译:基于深度学习技术的Twitter垃圾邮件活动检测

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Twitter spam has long been a critical but difficult problem to be addressed. So far, researchers have developed a series ofmachine learning–based methods and blacklisting techniques to detect spamming activities on Twitter. According to our investigation, current methods and techniques have achieved the accuracy of around 87%. However, because of the problems of spam drift and information fabrication, these machine learning–based methods cannot efficiently detect spam activities in real-life scenarios. Meanwhile, the blacklisting method also cannot catch up with the variations of spamming activities, as manually inspecting suspicious URLs is extremely timeconsuming. In this paper,weproposed a novel technique based on deep-learning technique to address the above challenges. The syntax of each tweet will be learned throughWordVector and trained bydeep learning.We then constructed a binary classifier todifferentiatespamandregular tweets. In experiments, we collected and labeled a 10-day real tweet dataset as ground truth to evaluate our proposed method.We first went for empirical analysis with a series of comparisons to other methods: (1) performance of different classifiers, (2) other existing text-based methods, and (3) nontext-based detection techniques. According to the experiment results, our proposed method largely outperformed previous methods.We further conducted principle component analysis on typical methods to theoretically justify the outperformance of our method.We extracted all kinds of features via dimensionality reduction. It was found that our featuresweremost distinct among all the detection methods. This well demonstrated the outperformance of our method.
机译:长期以来,Twitter垃圾邮件一直是一个关键但很难解决的问题。到目前为止,研究人员已经开发了一系列基于机器学习的方法和黑名单技术,以检测Twitter上的垃圾邮件活动。根据我们的调查,当前的方法和技术已达到87%左右的准确性。但是,由于垃圾邮件漂移和信息制造的问题,这些基于机器学习的方法无法有效地检测实际场景中的垃圾邮件活动。同时,由于手动检查可疑URL非常耗时,因此黑名单方法也无法跟上垃圾邮件活动的变化。针对以上挑战,本文提出了一种基于深度学习的新技术。每个推文的语法将通过WordVector进行学习,并通过深度学习进行训练。然后,我们构造了一个二元分类器来区分垃圾邮件。在实验中,我们收集并标记了10天的真实推文数据集作为基础事实,以评估我们提出的方法。我们首先进行了实证分析,并与其他方法进行了一系列比较:(1)不同分类器的性能;(2)其他现有的基于文本的方法,以及(3)非基于文本的检测技术。根据实验结果,我们提出的方法大大优于以前的方法。我们进一步对典型方法进行了主成分分析,从理论上证明了该方法的性能。通过降维提取了各种特征。发现我们的特征在所有检测方法中最不同。这很好地证明了我们方法的优越性。

著录项

  • 来源
    《Concurrency and Computation》 |2017年第19期|e4209.1-e4209.11|共11页
  • 作者单位

    School of Information Technology, Deakin University, Victoria 3125, Australia;

    School of Information Technology, Deakin University, Victoria 3125, Australia;

    School of Information Technology, Deakin University, Victoria 3125, Australia;

    School of Information Technology, Deakin University, Victoria 3125, Australia;

    School of Information Technology, Deakin University, Victoria 3125, Australia;

    College of Computer and Information Sciences King Saud University, Riyadh 11543,Saudi Arabia;

    College of Computer and Information Sciences King Saud University, Riyadh 11543,Saudi Arabia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    deep learning; social media security; twitter spam detection;

    机译:深度学习社交媒体安全;Twitter垃圾邮件检测;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号