首页> 外文期刊>Intelligent automation and soft computing >Oversampling Methods Combined Clustering and Data Cleaning for Imbalanced Network Data
【24h】

Oversampling Methods Combined Clustering and Data Cleaning for Imbalanced Network Data

机译:过采样方法组合聚类和数据清洁以进行不平衡网络数据

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In network anomaly detection, network traffic data are often imbalanced, that is, certain classes of network traffic data have a large sample data volume while other classes have few, resulting in reduced overall network traffic anomaly detection on a minority class of samples. For imbalanced data, researchers have proposed the use of oversampling techniques to balance data sets; in particular, an oversampling method called the SMOTE provides a simple and effective solution for balancing data sets. However, current oversampling methods suffer from the generation of noisy samples and poor information quality. Hence, this study proposes an oversampling method for imbalanced network traffic data that combines the SMOTE algorithm and FINCH clustering algorithm to filter out minority sample clusters, proposes a scheme to allocate the number of synthetic samples per cluster according to the clustering sparsity and sample weight, and finally uses multi-layer sensors for noisy sample cleaning during sampling. We compare the proposed method with other oversampling methods, verifying that a data set processed using this method works better in network traffic anomaly detection.
机译:在网络异常检测中,网络流量数据通常是不平衡的,即某些类别的网络流量数据具有大的样本数据量,而其他类别几乎没有,导致在少数类别类别上减少整体网络流量异常检测。对于不平衡的数据,研究人员提出了使用过采样技术来平衡数据集;特别地,称为SMOTE的过采样方法为平衡数据集提供了一种简单有效的解决方案。然而,目前的过采样方法遭受嘈杂的样本和信息质量差的产生。因此,本研究提出了一种用于非衡度网络流量数据的过采样方法,该方法将少数算法和芬清群集算法滤除少数群体集群,提出了根据聚类稀疏性和样品重量分配每簇合成样本数量的方案,最后使用多层传感器进行采样期间进行嘈杂的样品清洁。我们将提出的方法与其他过采样方法进行比较,验证使用此方法处理的数据集在网络流量异常检测方面更好地工作。

著录项

  • 来源
    《Intelligent automation and soft computing》 |2020年第5期|1139-1155|共17页
  • 作者单位

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100000 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100000 Peoples R China;

    Univ Melbourne Sch Comp & Informat Syst Cloud Comp & Distributed Syst Lab Melbourne Vic 3000 Australia;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100000 Peoples R China;

    China Elect Technol Grp Corp Res Inst 54 Shijiazhuang 050000 Hebei Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100000 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Imbalanced learning; oversampling; SMOTE; network anomaly detection;

    机译:学习的不平衡;过采样;Smote;网络异常检测;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号