...
首页> 外文期刊>Journal of supercomputing >Network anomaly detection based on selective ensemble algorithm
【24h】

Network anomaly detection based on selective ensemble algorithm

机译:基于选择性集合算法的网络异常检测

获取原文
获取原文并翻译 | 示例
           

摘要

In order to reduce the loss of information of the majority class samples in the resampling process, combining the distribution of class samples and the characteristics of ensemble learning algorithm, in this paper, a two-level selective ensemble learning algorithm for imbalanced datasets is proposed. Firstly, the algorithm under-samples the majority class samples and constructs multiple training subsets. The training process will generate multiple base classifiers using AdaBoost algorithm, then select some base classifiers according to maximum correlation and minimum redundancy criteria, and form sub-classifiers according to weighted integration. Then, generate multiple sub-classifiers for multiple training subsets, and then, select some sub-classifiers according to maximum correlation and minimum redundancy criteria. Then, the weights of the selected sub-classifiers are calculated by F-means or G-means, and the ensemble classifier is obtained by weighted voting. Finally, the improved algorithm for imbalanced dataset is applied to the network anomaly detection. The experimental results on UCI datasets show that this method can improve the classification performance to a certain extent, especially for imbalanced datasets. Finally, the algorithm is applied to network anomaly detection for Internet of Things. From the simulation data of KDDCUP99 dataset, we can see that TLSE-ID algorithm has a small missing report rate and high precision.
机译:为了减少重采样过程中多数类样本的信息丢失,在本文中组合了类样本的分布和集合学习算法的特性,提出了一种用于不平衡数据集的两级选择性集合学习算法。首先,该算法在大多数类样本中进行了样本并构建多个训练子集。训练过程将使用Adaboost算法生成多个基本分类器,然后根据最大相关和最小冗余标准选择一些基本分类器,并根据加权集成来形成子分类器。然后,为多个训练子集生成多个子分类器,然后根据最大相关和最小冗余标准选择一些子分类器。然后,通过F-il il或g-il算法计算所选子分类器的权重,并且通过加权投票获得集合分类器。最后,应用于网络异常检测的改进的不平衡数据集算法。 UCI数据集的实验结果表明,该方法可以在一定程度上提高分类性能,尤其是对于不平衡数据集。最后,将算法应用于网络异常检测以获得物联网。从KDDCup99数据集的模拟数据,我们可以看到TLSE-ID算法具有小的报告速率和高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号