首页> 外文会议>Annual IEEE India Conference >Machine Learning Based Internet Traffic Recognition with Statistical Approach
【24h】

Machine Learning Based Internet Traffic Recognition with Statistical Approach

机译:基于机器学习的互联网流量识别与统计方法

获取原文

摘要

The researchers have started looking for Internet traffic recognition techniques that are independent of 'well known' TCP or UDP port numbers, or interpreting the contents of packet payloads. Newer approaches classify traffic by recognizing statistical patterns in externally observable attributes of the traffic (such as typical packet lengths and inter-arrival times). The main goal is to cluster or classify the Internet traffic flows into groups that have identical statistical properties. The need to deal with Traffic patterns, large datasets and Multidimensional spaces of flow and packet attributes is one of the reasons for the introduction of Machine Learning (ML) techniques in this field. ML techniques are subset of Artificial Intelligence used for traffic recognition. Further, there are four types of Machine Learning, i.e. Classification (Supervised learning), clustering (Un-Supervised learning), Numeric prediction and Association. In this research paper IP traffic recognition through classification process is implemented. Different researchers are calling this process as IP traffic Recognition, IP traffic Identification, and sometimes IP traffic classification. Here Real time internet traffic has been captured using packet capturing tool and datasets has been developed. Also few standard datasets have been used in this research work. Then using standard attribute selection algorithms, a reduced statistical feature dataset has been developed. After that, Six ML algorithms AdaboostM1, C4.5, Random Forest tree, MLP, RBF and SVM with Polykernel function classifiers are used for IP traffic classification. This implementation and analysis shows that Tree based algorithms are effective ML techniques for Internet traffic classification with accuracy up to of 99.7616 %.
机译:研究人员已经开始寻找独立于“众所周知的”TCP或UDP端口号的互联网业务识别技术,或解释数据包有效载荷的内容。较新方法通过识别流量的外部可观察属性(例如典型分组长度和到达时间)中的外部可观察属性的统计模式来分类流量。主要目标是群集或将Internet流量流入具有相同统计属性的组。需要处理流量模式,大型数据集和流量和数据包属性的多维空间是在该字段中引入机器学习(ML)技术的原因之一。 ML技术是用于交通识别的人工智能的子集。此外,有四种类型的机器学习,即分类(监督学习),聚类(未经监督的学习),数字预测和关联。在本研究中,实施了通过分类过程的IP流量识别。不同的研究人员将此过程称为IP流量识别,IP流量识别,有时IP流量分类。这里使用数据包捕获工具捕获了实时互联网流量,并且已经开发了数据集。在本研究工作中也使用了很少的标准数据集。然后使用标准属性选择算法,已经开发了一个减少的统计特征数据集。之后,使用具有Polykernel函数分类器的六毫升算法Adaboostm1,C4.5,随机林树,MLP,RBF和SVM用于IP流量分类。该实施和分析表明,基于树的算法是互联网流量分类的有效ML技术,精度高达99.7616%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号