首页> 外文会议>The 2nd International Conference on Information Science and Engineering >A new re-sampling method for network traffic classification using SML
【24h】

A new re-sampling method for network traffic classification using SML

机译:一种使用SML进行网络流量分类的新重采样方法

获取原文

摘要

The way of internet traffic classification using machine learning has been a hot topic for a long time as it is independent on the packet payloads. However, the problems of classifier biasing towards the majority classes have not been solved effectively now. The uniform sampling is a popular technique to alleviate the data skew in machine learning traffic classification. But the original traffic data distribution would be destroyed by it. A new re-sampling method named tuning sampling for supervised machine learning (SML) is proposed to ease the problem of data skew in internet traffic classification. And it is compared with uniform sampling and stratified sampling methods using C4.5 classification algorithm. Our experimental results indicate that the classifier using tuning sampling gets the accuracy of minority classes are higher than the results of stratified sampling and the overall accuracy is higher than the result of uniform sampling.
机译:长期以来,使用机器学习进行互联网流量分类的方法一直是热门话题,因为它与数据包有效负载无关。但是,分类器偏向多数类的问题目前尚未得到有效解决。统一采样是减轻机器学习流量分类中数据偏斜的一种流行技术。但是原始的交通数据分配将被其破坏。为了缓解互联网流量分类中的数据偏斜问题,提出了一种新的重采样方法,称为有监督的机器学习调整采样(SML)。并与采用C4.5分类算法的均匀抽样和分层抽样方法进行了比较。我们的实验结果表明,使用调整抽样的分类器获得的少数类别的准确度高于分层抽样的结果,总体准确性高于统一抽样的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号