首页> 外文会议>Nirma University International Conference on Engineering >Quantifying concept drifting in network traffic using ROC curves from Naive Bayes classifiers
【24h】

Quantifying concept drifting in network traffic using ROC curves from Naive Bayes classifiers

机译:使用Naive Bayes分类器的ROC曲线量化概念漂移在网络流量中

获取原文

摘要

Concept drifting poses a real challenge for network models which depends on statistical heuristics learned from the data stream, for example Anomaly Based Detection/Prevention Systems. These models tend to become inconsistent over a period of time as the underlying data stream like network traffic tends to change and get affected by evolution of concept drift. Change in network traffic pattern is inevitable, it impacts the enterprises which are dynamic in nature especially cloud-centric enterprises. These changes in the network pattern can be of short time period or they can be persistent for longer time duration. Change in network traffic pattern is not always because of malicious activity, changes can be benign and thus impacting the performance of the IDS/IPS model. There is a need to quantify concept drifts and incorporate them in the model. In this paper we have proposed a supervised learning model to quantify the concept drift in the network traffic. The proposed model uses adaptive learning strategies with fixed training window to constantly evolve the model. Classification of data is done by Naive Bayes Classifier. ROC curve generated from Naive Bayes classifiers has been used as a de facto method for identifying concept drift. Classifications have been carried out on entire dataset and also on specific flow attributes like source ip, destination ip, source port, destination port, flags and protocols. In this paper we demonstrate the capabilities of the proposed model to identify drift in the network pattern and also which flow attributes have contributed in concept drifting using ROC curve.
机译:概念漂移对网络模型构成了真正的挑战,这取决于从数据流中学到的统计启发式,例如基于异常的检测/预防系统。由于网络流量等底层数据流往往会受到概念漂移的演变影响,这些模型在一段时间内变得不一致。网络流量模式的变化是不可避免的,它会影响自然界动态的企业,特别是以云为中心的企业。网络模式的这些变化可以是短时间期,或者它们可以持久地持续到更长的时间持续时间。网络流量模式的变化并不总是由于恶意活动,更改可能是良性的,从而影响IDS / IPS模型的性能。需要量化概念漂移并将它们整合在模型中。在本文中,我们提出了一个监督的学习模型来量化网络流量的概念漂移。该建议的模型使用固定训练窗口的自适应学习策略来不断地发展模型。数据分类由Naive Bayes分类器完成。从天真贝叶斯分类器产生的ROC曲线已被用作识别概念漂移的事实上。已经在整个数据集上执行了分类,也在源IP,目标IP,源端口,目标端口,标志和协议等特定流量属性上进行。在本文中,我们展示了所提出的模型来识别网络模式的漂移的能力,以及使用ROC曲线在概念漂移中贡献的流量属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号