首页> 外文期刊>The Journal of Supercomputing >A novel unsupervised classification approach for network anomaly detection by k-Means clustering and ID3 decision tree learning methods
【24h】

A novel unsupervised classification approach for network anomaly detection by k-Means clustering and ID3 decision tree learning methods

机译:一种通过k-Means聚类和ID3决策树学习方法进行网络异常检测的新型无监督分类方法

获取原文
获取原文并翻译 | 示例
       

摘要

This paper presents a novel host-based combinatorial method based on k-Means clustering and ID3 decision tree learning algorithms for unsupervised classification of anomalous and normal activities in computer network ARP traffic. The k-Means clustering method is first applied to the normal training instances to partition it into k clusters using Euclidean distance similarity. An ID3 decision tree is constructed on each cluster. Anomaly scores from the k-Means clustering algorithm and decisions of the ID3 decision trees are extracted. A special algorithm is used to combine results of the two algorithms and obtain final anomaly score values. The threshold rule is applied for making the decision on the test instance normality. Experiments are performed on captured network ARP traffic. Some anomaly criteria has been defined and applied to the captured ARP traffic to generate normal training instances. Performance of the proposed approach is evaluated using five defined measures and empirically compared with the performance of individual k-Means clustering and ID3 decision tree classification algorithms and the other proposed approaches based on Markovian chains and stochastic learning automata. Experimental results show that the proposed approach has specificity and positive predictive value of as high as 96 and 98%, respectively.
机译:本文提出了一种基于主机的组合方法,该方法基于k-Means聚类和ID3决策树学习算法,用于计算机网络ARP流量中异常活动和正常活动的无监督分类。首先将k-Means聚类方法应用于正常训练实例,然后使用欧几里得距离相似度将其划分为k个聚类。每个群集上都构建一个ID3决策树。从k均值聚类算法和ID3决策树的决策中提取异常分数。使用特殊算法将两种算法的结果结合起来,以获得最终的异常得分值。阈值规则用于决定测试实例的正常性。对捕获的网络ARP流量进行实验。已经定义了一些异常标准并将其应用于捕获的ARP流量以生成正常的训练实例。使用五个定义的方法评估了该方法的性能,并与单个k-Means聚类和ID3决策树分类算法以及其他基于马尔可夫链和随机学习自动机的方法进行了经验比较。实验结果表明,该方法的特异性和阳性预测值分别高达96%和98%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号