首页> 外文会议>IEEE International Conference on Data Science in Cyberspace >Webshell Detection Based on Random Forest–Gradient Boosting Decision Tree Algorithm
【24h】

Webshell Detection Based on Random Forest–Gradient Boosting Decision Tree Algorithm

机译:基于随机森林梯度升压决策树算法的WebShell检测

获取原文

摘要

Webshell is a type of web backdoor which can be used by hackers to control web servers remotely. It is true that webshell becomes increasingly hard to detect because of the use of more and more hiding technologies, such decoding and encrypting. However, webshell still can be detected with high accuracy by virtue of machine learning algorithms. In this paper, we proposed a PHP webshell detecting model, the RF-GBDT (Random Forest-Gradient Boosting Decision Tree) model, which is the combination of random forest classifier and GBDT classifier. Besides, we not only used the common statistical features of PHP source files, such as information entropy, index of coincidence and so forth, but also extracted opcode sequence features from PHP source files, including TF-IDF vector and hash vector. Based on the RF-GBDT model and those effective features, the RF-GBDT PHP webshell prediction model shows an excellent performance, achieving the accuracy of 99.169% with false positive rate of 0.682%, shadowing several popular webshell detectors.
机译:WebShell是一种Web BackDoor,可以由黑客使用,以远程控制Web服务器。如果使用越来越多的隐藏技术,这种解码和加密,WebShell越来越难以检测。但是,由于机器学习算法,可以通过高精度来检测WebShell。在本文中,我们提出了一个PHP网晶检测模型,RF-GBDT(随机林梯度升压决策树)模型,是随机林分类器和GBDT分类器的组合。此外,我们不仅使用了PHP源文件的常见统计特征,例如信息熵,巧合索引等,而且还从PHP源文件中提取了Opcode序列特征,包括TF-IDF矢量和哈希矢量。基于RF-GBDT模型和这些有效特征,RF-GBDT PHP WebShell预测模型显示出优异的性能,实现了99.169 %的精度,假阳性率为0.682 %,阴影几个流行的网晶探测器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号