首页> 外文期刊>Computer Communications >Imbalanced traffic identification using an imbalanced data gravitation-based classification model
【24h】

Imbalanced traffic identification using an imbalanced data gravitation-based classification model

机译:使用基于不平衡数据引力的分类模型进行不平衡交通识别

获取原文
获取原文并翻译 | 示例
           

摘要

As a basic method for monitoring the activities of Internet applications, traffic identification is very important for Internet management and security. Internet traffic data naturally exhibits imbalanced distributions, but this problem is rarely considered by the research community. Data gravitation-based classification (DGC) is a new classification model for handling imbalance data sets and we proposed an imbalanced DGC (IDGC) model in our previous study. In the present study, we developed an IDGC-based model to resolve imbalanced Internet traffic identification problems. First, we constructed six imbalanced traffic data sets from three original traffic data sets and we then extracted their early stage features according to the packet sizes. In identification experiments, we compared the performance of six standard algorithms, including DGC and four imbalanced algorithms with IDGC. The experimental results demonstrated that the standard classification models could achieve high accuracy with imbalanced traffic data, but their imbalanced performance was not as good, and their generalizability was also a problem. By contrast, IDGC performed very well and in a stable manner in terms of imbalanced performance measures, thereby demonstrating its effectiveness for imbalanced traffic identification. (C) 2016 Elsevier B.V. All rights reserved.
机译:作为监视Internet应用程序活动的基本方法,流量识别对于Internet管理和安全性非常重要。互联网流量数据自然呈现不平衡的分布,但是研究界很少考虑此问题。基于数据引力的分类(DGC)是用于处理不平衡数据集的新分类模型,我们在先前的研究中提出了不平衡DGC(IDGC)模型。在本研究中,我们开发了一种基于IDGC的模型来解决不平衡的Internet流量识别问题。首先,我们从三个原始交通数据集中构造了六个不平衡交通数据集,然后根据数据包大小提取了它们的早期特征。在识别实验中,我们比较了六种标准算法(包括DGC)和四种不平衡算法与IDGC的性能。实验结果表明,标准分类模型在交通数据不平衡的情况下可以达到较高的精度,但其不平衡性能不佳,推广性也存在问题。相比之下,IDGC在不平衡的性能指标方面表现良好且稳定,因此证明了其在不平衡流量识别中的有效性。 (C)2016 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Computer Communications》 |2017年第1期|177-189|共13页
  • 作者单位

    Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China;

    Univ Otago, Dept Comp Sci, Dunedin, New Zealand;

    Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China;

    Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Data gravitation; Imbalanced data; Machine learning; Traffic identification;

    机译:数据引力;数据失衡;机器学习;交通识别;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号