【24h】

Deep-learnt features for Twitter spam detection

机译:Twitter垃圾邮件检测的深度学习功能

获取原文

摘要

Twitter spam has become one of the most critical problems in recent years. Despite the efforts of researchers and security companies, the growing number of spam is not stopping. Machine learning is a very popular technology in network security and is also used for spam detection. An important step of applying machine learning for Twitter spam detection is feature engineering. Existing works mainly use URL based features, meta-data based features and social relation based features to detect spam tweets. All of the above mentioned works require human effort to extract features. More recently, deep learning has shed its light on automated feature engineering in extracting features from text. In this paper, we propose a new feature engineering mechanism based on a deep neural network trained using Bi-LSTM. We name the extracted features “deep-learnt features”. We compare our feature set with word2vec features and statistical features in the experimental evaluation. The results show that machine learning models trained using deep-learnt features can detect Twitter spam more accurately than models trained using word2vec features and statistcal features.
机译:Twitter垃圾邮件已成为近年来最关键的问题之一。尽管研究人员和安全公司的努力,但越来越多的垃圾邮件不会停止。机器学习是网络安全中非常流行的技术,也用于垃圾邮件检测。对Twitter垃圾邮件检测进行机器学习的一个重要步骤是具有工程工程。现有的作品主要使用基于URL的功能,基于元数据的特征和基于社交关系的功能来检测垃圾邮件推文。所有上述作品都需要人们努力提取特征。最近,深入学习在从文本中提取功能中的自动特征工程上阐述了它的光。在本文中,我们提出了一种基于使用Bi-LSTM训练的深神经网络的新特征工程机制。我们命名提取的功能“深度学习功能”。我们将我们的功能设置为在实验评估中使用Word2Vec功能和统计功能。结果表明,使用深度学习功能训练的机器学习模型可以比使用Word2Vec功能和统计特征更准确地检测Twitter垃圾邮件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号