首页> 外文会议> >SMOTE Implementation on Phishing Data to Enhance Cybersecurity
【24h】

SMOTE Implementation on Phishing Data to Enhance Cybersecurity

机译:SMOTE实施网上诱骗数据以增强网络安全

获取原文

摘要

Phishing is a form of cybersecurity threat where the criminal tries to gain access to users personal information by infecting their system using malware and viruses. Appearing to come from legitimate sources, it is very easy to fall in the phishing scam. As each phishing data contains features that are consistently different from another, using a predefined set of rules makes a system useless. Data mining techniques can be applied to collected network traffic and build models to predict future attacks. However, since most of the data packets are legitimate, the model tends to produce a bias towards positive results in this imbalanced dataset. In this study, we investigate how prediction accuracy varies in a balanced dataset against an imbalanced one. SMOTE is applied to balance the dataset. XGBoost, Random Forest and Support Vector Machines have been applied on the phishing dataset. Results show much higher accuracy rates with SMOTE application. The highest jump in accuracy has been recorded in XGBoost from 89.87% to 97.17% showing that SMOTE is an effective tool in phishing data monitoring.
机译:网络钓鱼是一种网络安全威胁,犯罪分子试图通过使用恶意软件和病毒感染用户的系统来访问用户的个人信息。似乎来自合法来源,很容易陷入网络钓鱼诈骗。由于每个网络钓鱼数据都包含与其他网络钓鱼功能始终不同的功能,因此使用一组预定义的规则会使系统无用。数据挖掘技术可以应用于收集的网络流量并建立模型以预测未来的攻击。但是,由于大多数数据包都是合法的,因此该模型倾向于在此不平衡数据集中产生对正结果的偏见。在这项研究中,我们调查了平衡数据集中相对于不平衡数据集的预测准确性如何变化。 SMOTE用于平衡数据集。 XGBoost,随机森林和支持向量机已应用于网络钓鱼数据集。结果表明,使用SMOTE应用程序的准确率更高。 XGBoost记录的准确性最高跃升从89.87%上升到97.17%,表明SMOTE是有效的网络钓鱼数据监视工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号