【24h】

An Effective Neural Network Phishing Detection Model Based on Optimal Feature Selection

机译:基于最优特征选择的有效神经网络网络钓鱼检测模型

获取原文
获取原文并翻译 | 示例

摘要

As a common means to obtain user privacy information, phishing poses a big threat to people's daily network environment. The detection and prevention the threats of phishing websites are of importance. Due to the active learning ability from large-scale datasets, neural network is an important heuristic machine learning method in phishing websites detection and prevention. However, during the process of data training, some useless features may cause the machine learning method to over-fitting which will result in the training model not being able to precisely predict and detect the phishing websites. Aiming at this problem, based on the optimal feature selection method, this paper proposes an effective neural network detection model (OFS-NN) to detect the pushing websites. Under this model, an optimal feature selection algorithm that adapts to the sensitive features of phishing URLs (Uniform Resource Locators) is firstly proposed. Based on the calculation of the effective value of each feature, this algorithm sets a threshold to eliminate some useless features and selects an optimal feature set suitable for detecting phishing websites. Then, the selected optimal feature set is trained by the neural network to construct an optimal classifier to classify and predict the pushing websites. The experimental results have shown that the proposed OFS-NN provides an effective solution for predicting and detecting phishing websites. It has little false positive rate and strong generalization ability. In addition, the optimal feature selection algorithm improves the performance in the sample training process of machine learning methods.
机译:网络钓鱼作为获取用户隐私信息的常用手段,对人们的日常网络环境构成了巨大威胁。检测和预防网络钓鱼网站的威胁非常重要。由于能够从大规模数据集中进行主动学习,因此神经网络是网络钓鱼网站检测和预防中的一种重要的启发式机器学习方法。但是,在数据训练过程中,一些无用的功能可能会导致机器学习方法过度拟合,从而导致训练模型无法准确地预测和检测网络钓鱼网站。针对该问题,基于最优特征选择方法,提出了一种有效的神经网络检测模型(OFS-NN),用于检测推送网站。在此模型下,首先提出了一种适合网络钓鱼URL(统一资源定位符)敏感特征的最优特征选择算法。基于每个特征的有效值的计算,该算法设置了一个阈值以消除一些无用的特征,并选择了适合检测网络钓鱼网站的最佳特征集。然后,所选择的最优特征集被神经网络训练以构造最优分类器以对推送网站进行分类和预测。实验结果表明,所提出的OFS-NN为网络钓鱼网站的预测和检测提供了有效的解决方案。假阳性率低,泛化能力强。另外,最优特征选择算法提高了机器学习方法的样本训练过程的性​​能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号