首页> 外文会议>International Conference on Cloud Computing, Data Science and Engineering >Analysis of Ensemble Learning Models for Identifying Spam over Social Networks using Recursive Feature Elimination
【24h】

Analysis of Ensemble Learning Models for Identifying Spam over Social Networks using Recursive Feature Elimination

机译:使用递归特征消除分析用于识别社交网络垃圾邮件的集合学习模型

获取原文

摘要

Social Networking platforms are regarded as being the reliable and valued communication medium for transferring information and communicating, used by the millions throughout the world. Users’ reliance on these social networking sites is growing to seek perspectives, updates, alerts, news etc. While it is evident that the online social networks have become a way for information sharing, at the same instant they have rapidly become a medium for spreading misinformation, rumors, unsolicited messages, propaganda, fake news, and so on. It can indeed be said that a social networking platform consists of the two types of users, namely Spammers and Non-Spammers. Spammers typically spread misinformation or share undesirable content on social networking websites, out of malicious intents. In this work, a model is proposed to identify Spammers in Twitter network. This work is based on the user behavior-based and content-basedfeatures like Hashtags, URLs, Mentions, Replies, and Retweets. In this work, the Recursive Feature Elimination is used along with Support Vector Machine, Random Forest, Logistic Regression, Adaptive Boosting, and XGBoost. For data pre-processing, Weka is used, and implemented the five classifiers mentioned with Recursive Feature Elimination using sklearn in Python. Performance measures such as TP Rate, FP Rate, Precision, F-Measure and Accuracy are used for evaluating the performance of the proposed model
机译:社交网络平台被视为作为传输信息和沟通的可靠和有价值的通信介质,这些媒体由全世界数百万使用。用户依赖这些社交网站正在增长,以寻求观点,更新,警报,新闻等。尽我明很明显,在线社交网络已成为信息共享的方式,同时他们迅速成为传播的媒介误导,谣言,未经请求的消息,宣传,假新闻等。确实可以说,社交网络平台包括两种类型的用户,即垃圾邮件发送者和非垃圾邮件发送者。垃圾邮件发送者通常在恶意意图中传播在社交网站上的错误信息或分享不良内容。在这项工作中,提出了一种模型来识别Twitter网络中的垃圾邮件。这项工作基于基于用户的行为和基于内容的基于HashTags,URL,提到,回复和转发。在这项工作中,递归功能消除以及支持向量机,随机林,逻辑回归,自适应升压和XGBoost。对于数据预处理,使用Weka,并在Python中使用Sklearn实现了使用Sklearn的递归特征消除的五个分类器。 TP速率,FP速率,精度,F测量和精度等性能测量用于评估所提出的模型的性能

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号