首页> 外文期刊>International Journal of Performability Engineering >Applying an Improved Elephant Herding Optimization Algorithm with Spark-based Parallelization to Feature Selection for Intrusion Detection
【24h】

Applying an Improved Elephant Herding Optimization Algorithm with Spark-based Parallelization to Feature Selection for Intrusion Detection

机译:应用了一种改进的大象放牧优化算法与火花的并行化与入侵检测特征选择

获取原文
获取原文并翻译 | 示例
       

摘要

With the growth of the intrusion data scale model, irrelevant or redundant features in high-dimensional intrusion detection data leads to slow processing speed of the intrusion detection algorithm, and the consumption of the algorithm in time and space will increase as the feature dimensions increase. In view of good classification performance of the Elephant Herding Optimization (EHO) algorithm in reducing feature redundancy, this paper introduces the EHO algorithm into feature selection for intrusion detection. Since the basic EHO algorithm tends to fall into a local optimum and lacks strong search ability, the classification performance and dimensional reduction ability of the algorithm are severely limited. Therefore, an Improved Elephant Herding Optimization (IEHO) algorithm is proposed in this paper to search the feature space and find the optimal feature subset, so that the feature number is minimized while the classification performance is maximized. As the scale of intrusion data grows, the large amount of redundant information in the intrusion data will cause the improved algorithm to process slowly. Thus, in this case, the improved algorithm is considered to be parallelized to relieve the pressure of single-machine operation. This paper then proposes a Spark-based distributed parallel IEHO algorithm for intrusion detection, and a feature selection method based on this algorithm for intrusion detection is discussed. The feature selection in a distributed environment can improve the running efficiency of the IEHO algorithm, so as to reduce the running time of the algorithm under the premise of ensuring classification accuracy. As for the experimental validation, both UCI and KDD CUP99 datasets are used to verify the feature selection for intrusion detection. Compared with the classical PSO, MFO, and EHO algorithms, the feature selection by the binary IEHO algorithm is improved by 4.16%, 1.42%, and 0.98%, respectively, and the classification performance is also significantly improved. Compared with the stand-alone version of the IEHO algorithm, the classification efficiency of the parallel IEHO algorithm based on Spark for intrusion feature selection is significantly improved, and the acceleration ratio is increased by two orders of magnitude.
机译:随着入侵数据刻度模型的增长,高维入侵检测数据中的无关或冗余特征导致入侵检测算法的缓慢处理速度,并且随着特征尺寸的增加,算法的消耗量会增加。鉴于大象放牧优化(EHO)算法的良好分类性能降低了特征冗余,将EHO算法介绍了入侵检测的特征选择。由于基本的EHO算法倾向于落入本地最佳且缺乏强烈的搜索能力,因此算法的分类性能和尺寸减小能力严重限制。因此,在本文中提出了一种改进的大象放牧优化(IEHO)算法以搜索特征空间并找到最佳特征子集,从而最小化特征号,而分类性能最大化。随着入侵数据的规模增长,入侵数据中的大量冗余信息将导致改进的算法缓慢处理。因此,在这种情况下,改进的算法被认为是并行化以释放单机操作的压力。然后,本文提出了一种基于火花的分布式并行IEHO算法,用于入侵检测,并且讨论了基于该算法的入侵检测算法的特征选择方法。分布式环境中的特征选择可以提高IEHO算法的运行效率,从而减少确保分类准确性的前提下的算法的运行时间。至于实验验证,UCI和KDD Cup99数据集都用于验证入侵检测的特征选择。与古典PSO,MFO和EHO算法相比,二进制IEHO算法的特征选择分别提高了4.16%,1.42%和0.98%,分类性能也显着提高。与IEHO算法的独立版本相比,基于用于入侵特征选择的Spark的并行IEHO算法的分类效率得到了显着改善,并且加速度增加了两个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号