Deep-learnt features for Twitter spam detection

机译：Twitter垃圾邮件检测的深度学习功能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Twitter spam has become one of the most critical problems in recent years. Despite the efforts of researchers and security companies, the growing number of spam is not stopping. Machine learning is a very popular technology in network security and is also used for spam detection. An important step of applying machine learning for Twitter spam detection is feature engineering. Existing works mainly use URL based features, meta-data based features and social relation based features to detect spam tweets. All of the above mentioned works require human effort to extract features. More recently, deep learning has shed its light on automated feature engineering in extracting features from text. In this paper, we propose a new feature engineering mechanism based on a deep neural network trained using Bi-LSTM. We name the extracted features “deep-learnt features”. We compare our feature set with word2vec features and statistical features in the experimental evaluation. The results show that machine learning models trained using deep-learnt features can detect Twitter spam more accurately than models trained using word2vec features and statistcal features.

机译：Twitter垃圾邮件已成为近年来最关键的问题之一。尽管研究人员和安全公司的努力，但越来越多的垃圾邮件不会停止。机器学习是网络安全中非常流行的技术，也用于垃圾邮件检测。对Twitter垃圾邮件检测进行机器学习的一个重要步骤是具有工程工程。现有的作品主要使用基于URL的功能，基于元数据的特征和基于社交关系的功能来检测垃圾邮件推文。所有上述作品都需要人们努力提取特征。最近，深入学习在从文本中提取功能中的自动特征工程上阐述了它的光。在本文中，我们提出了一种基于使用Bi-LSTM训练的深神经网络的新特征工程机制。我们命名提取的功能“深度学习功能”。我们将我们的功能设置为在实验评估中使用Word2Vec功能和统计功能。结果表明，使用深度学习功能训练的机器学习模型可以比使用Word2Vec功能和统计特征更准确地检测Twitter垃圾邮件。

著录项

来源
《International Symposium on Security and Privacy in Social Networks and Big Data》|2018年|1 v.|共5页
会议地点
作者
Xinbo Ban; Chao Chen; Shigang Liu; Yu Wang; Jun Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Feature extraction; Twitter; Uniform resource locators; Deep learning; Blacklisting; Malware;

机译：特征提取;Twitter;统一资源定位器;深入学习;黑名单;恶意软件;

相似文献

外文文献
中文文献
专利

1. Spam detection on Twitter using a support vector machine and users' features by identifying their interactions [J] . Ahmad Saleh Beyt Sheikh, Rafie Mahnaz, Ghorabie Seyed Mojtaba Multimedia Tools and Applications . 2021,第8期

机译：通过识别其交互，使用支持向量机和用户的功能对Twitter进行垃圾邮件检测
2. Statistical Features-Based Real-Time Detection of Drifted Twitter Spam [J] . IEEE transactions on information forensics and security . 2017,第4期

机译：基于统计功能的Twitter Twitter垃圾邮件实时检测
3. A performance evaluation of deep-learnt features for software vulnerability detection [J] . Xinbo Ban, Shigang Liu, Chao Chen, Concurrency, practice and experience . 2019,第19期

机译：用于软件漏洞检测的深度学习功能的性能评估
4. Deep-learnt features for Twitter spam detection [C] . Xinbo Ban, Chao Chen, Shigang Liu, 2018 International Symposium on Security and Privacy in Social Networks and Big Data . 2018

机译：Twitter垃圾邮件检测的深度学习功能
5. Mitigating spam using network-level features [D] . Ramachandran, Anirudh V. 2011

机译：使用网络级功能缓解垃圾邮件
6. Multi-Rule Based Ensemble Feature Selection Model for Sarcasm Type Detection in Twitter [O] . Karthik Sundararajan, Anandhakumar Palanisamy 2020

机译：Twitter中讽刺类型检测的基于多规则的集合特征选择模型
7. Statistical Features-Based Real-Time Detection of Drifted Twitter Spam [O] . Chen C, Wang Y, Zhang J, 2017

机译：基于统计功能的Twitter Twitter垃圾邮件实时检测

Deep-learnt features for Twitter spam detection

摘要

著录项

相似文献

相关主题

期刊订阅