...
首页> 外文期刊>Big Data >An Ensemble-Based Scalable Approach for Intrusion Detection Using Big Data Framework
【24h】

An Ensemble-Based Scalable Approach for Intrusion Detection Using Big Data Framework

机译:基于集合的可扩展方法,用于使用大数据框架的入侵检测

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this study, we set up a scalable framework for large-scale data processing and analytics using the big data framework. The popular classification methods are implemented, tuned, and evaluated by using intrusion datasets. The objective is to select the best classifier after optimizing the hyper-parameters. We observed that the decision tree (DT) approach outperforms compared with other methods in terms of classification accuracy, fast training time, and improved average prediction rate. Therefore, it is selected as a base classifier in our proposed ensemble approach to study class imbalance. As the intrusion datasets are imbalanced, most of the classification techniques are biased toward the majority class. The misclassification rate is more in the case of the minority class. An ensemble-based method is proposed by using K-Means, RUSBoost, and DT approaches to mitigate the class imbalance problem; empirically investigate the impact of class imbalance on classification approaches' performance; and compare the result by using popular performance metrics such as Balanced Accuracy, Matthews Correlation Coefficient, and F-Measure, which are more suitable for the assessment of imbalanced datasets.
机译:在这项研究中,我们使用大数据框架为大规模数据处理和分析设置可扩展框架。通过使用入侵数据集来实现,调整和评估流行的分类方法。目标是在优化超参数后选择最佳分类器。我们观察到决策树(DT)接近胜过与其他方法在分类精度,快速训练时间和改进的平均预测率方面相比。因此,在我们建议的集合方法中选择了作为基础分类器来研究类别不平衡。由于入侵数据集是不平衡的,大多数分类技术都偏向于多数类。在少数阶级的情况下,错误分类率更多。通过使用K-Means,Rusboost和DT方法提出基于合奏的方法来减轻类别不平衡问题;经验研究阶级失衡对分类方法的影响;并通过使用均衡精度,Matthews相关系数和F测量等流行性能指标进行比较结果,这些测量更适合于评估不平衡数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号