...
首页> 外文期刊>Computer Communications >Optimization scheme for intrusion detection scheme GBDT in edge computing center
【24h】

Optimization scheme for intrusion detection scheme GBDT in edge computing center

机译:边缘计算中心入侵​​检测方案GBDT的优化方案

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Combination of edge computing technologies and machine learning help to put edge intelligence into practice. Industrial Internet of Things (IIoT) is one of its most typical applications. But this system can be easily attacked in the process of using edge computing center to process localized perception data. Intrusion detection technologies based on machine learning provide strong security for edge computing center, in which the most widely used is gradient boosting decision tree (i.e., GBDT). But still this model faces with problems such as imbalanced data, high dimensional data characteristics, and low efficiency of parameter optimization. To solve these problems, this paper proposes an optimization scheme for GBDT to improve its detection precision and training efficiency. First, to solve the problem of imbalanced data in data set, we propose a margin synthetic minority oversampling technique (i.e., MSMOTE), which can expand the non-noise data with less sample size, namely, small sample, to ensure equilibrium distribution of data. Second, to lower the data feature dimensionality, we propose a recursive feature elimination-hierarchy cross validation algorithm (i.e., RFE-HCV). The new algorithm eliminates redundant data features recursively according to feature weight, to strengthen the relationship between features and goals. It also designs hierarchy system to ensure equal proportionment of data category (attack category) in training set and testing set at cross validation stage. Next, in order to improve the efficiency of parameter optimization in model training process, we develop a flexible grid search algorithm (i.e., FGS) to improve retrieval efficiency of optimum parameters. Finally, the detailed experimental results show that our new scheme ensures data balance in dataset and eliminates redundant data features, and helps the efficiency of parameter optimization increase by three times. Moreover, the new scheme defends against intrusion more effectively.
机译:边缘计算技术和机器学习的组合有助于将边缘智能放入实践中。工业互联网(IIT)是其最典型的应用之一。但是该系统可以在使用边缘计算中心处理本地化感知数据的过程中容易地攻击。基于机器学习的入侵检测技术为边缘计算中心提供了强大的安全性,其中最广泛使用的是梯度升压决策树(即,GBDT)。但是,这种型号面临着不平衡数据,高维数据特性和低参数优化效率等问题。为了解决这些问题,本文提出了一种用于GBDT的优化方案,以提高其检测精度和培训效率。首先,为了解决数据集中的不平衡数据问题,我们提出了一种利润合成少数群体过采样技术(即MSMOTE),可以扩展具有更少样本大小的非噪声数据,即小样本,以确保平衡分布数据。其次,为了降低数据特征维度,我们提出了一种递归特征消除 - 层次结构交叉验证算法(即,RFE-HCV)。新算法根据特征权重递归地消除了冗余数据特征,以增强特征与目标之间的关系。它还设计了层次结构系统,以确保训练集和跨验证阶段的测试集中的数据类别(攻击类别)等于相同的数据类别(攻击类别)。接下来,为了提高模型训练过程中参数优化的效率,我们开发灵活的网格搜索算法(即,FGS),以提高最佳参数的检索效率。最后,详细的实验结果表明,我们的新方案可确保数据集中的数据余额,并消除冗余数据功能,并有助于参数优化的效率增加三次。此外,新方案更有效地防止了入侵。

著录项

  • 来源
    《Computer Communications》 |2021年第2期|136-145|共10页
  • 作者单位

    Ocean Univ China Coll Informat Sci & Engn Qingdao 266100 Peoples R China|Qingdao Univ Coll Comp Sci & Technol Qingdao 266071 Peoples R China;

    Ocean Univ China Coll Informat Sci & Engn Qingdao 266100 Peoples R China;

    Ocean Univ China Coll Informat Sci & Engn Qingdao 266100 Peoples R China;

    Qingdao Univ Coll Comp Sci & Technol Qingdao 266071 Peoples R China;

    Qingdao Univ Coll Comp Sci & Technol Qingdao 266071 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Intrusion detection; Gradient boosting decision tree; Machine learning; Ensemble learning;

    机译:入侵检测;渐变升压决策树;机器学习;集合学习;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号