A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark

Yong Wang; Wenlong Ke; Xiaoling Tao

首页> 外文期刊>Information >A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark

【24h】

A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark

机译：基于Spark的大规模网络流量分类的特征选择方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Currently, with the rapid increasing of data scales in network traffic classifications, how to select traffic features efficiently is becoming a big challenge. Although a number of traditional feature selection methods using the Hadoop-MapReduce framework have been proposed, the execution time was still unsatisfactory with numeral iterative computations during the processing. To address this issue, an efficient feature selection method for network traffic based on a new parallel computing framework called Spark is proposed in this paper. In our approach, the complete feature set is firstly preprocessed based on Fisher score, and a sequential forward search strategy is employed for subsets. The optimal feature subset is then selected using the continuous iterations of the Spark computing framework. The implementation demonstrates that, on the precondition of keeping the classification accuracy, our method reduces the time cost of modeling and classification, and improves the execution efficiency of feature selection significantly.

机译：当前，随着网络流量分类中数据规模的迅速增加，如何有效地选择流量特征成为一个巨大的挑战。尽管已经提出了许多使用Hadoop-MapReduce框架的传统特征选择方法，但是在处理过程中使用数字迭代计算的执行时间仍然不能令人满意。为了解决这个问题，本文提出了一种基于新型并行计算框架Spark的高效的网络流量特征选择方法。在我们的方法中，首先基于Fisher分数对完整的特征集进行预处理，然后对子集采用顺序正向搜索策略。然后，使用Spark计算框架的连续迭代来选择最佳特征子集。实例表明，在保持分类精度的前提下，我们的方法减少了建模和分类的时间成本，显着提高了特征选择的执行效率。

著录项

来源
《Information》 |2016年第1期|共页
作者
Yong Wang; Wenlong Ke; Xiaoling Tao;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. CLASSIFICATION OF ROAD TRAFFIC CONDITIONS BASED ON TEXTURE FEATURES OF TRAFFIC IMAGES USING NEURAL NETWORKS [J] . Teresa PAMU?A Scientific Journal of Silesian University of Technology: Series Transport . 2016,第6期

机译：基于神经网络的交通图像纹理特征的道路交通状况分类
2. An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification [J] . Hongtao Shi, Hongping Li, Dan Zhang, Computer networks . 2018,第FEBa26期

机译：基于深度学习和特征选择技术的高效交通特征生成方法
3. Classification of flower species by using features extracted from the intersection of feature selection methods in convolutional neural network models [J] . Measurement . 2020,第期

机译：使用从卷积神经网络模型中的特征选择方法中提取的特征来分类花卉种类
4. Spark-Based Feature Selection Algorithm of Network Traffic Classification [C] . Wenlong Ke, Yong Wang, Xiaochun Lei, International Conference on Computational Intelligence and Security . 2017

机译：基于Spark的网络流量分类特征选择算法
5. Statistical model-based methods for observation selection in wireless sensor networks and for feature selection in classification. [D] . Qi, Qi. 2012

机译：基于统计模型的方法用于无线传感器网络中的观察选择和分类中的特征选择。
6. A Three-Stage Teacher Student Neural Networks and Sequential Feed Forward Selection-Based Feature Selection Approach for the Classification of Autism Spectrum Disorder [O] . Naseer Ahmed Khan, Samer Abdulateef Waheeb, Atif Riaz, 2020

机译：三阶段教师学生神经网络和基于顺序馈线的基于顺序馈线的特征选择方法用于自闭症谱系障碍分类
7. A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark [O] . Yong Wang, Wenlong Ke, Xiaoling Tao 2016

机译：基于spark的大规模网络流量分类特征选择方法
8. Salient Feature Selection Using Feed-Forward Neural Networks and Signal- to-Noise Ratios with a Focus Toward Network Threat Detection and Classification. [R] . K. L. Moore 2014

机译：使用前馈神经网络和信噪比的突出特征选择，重点是网络威胁检测和分类。

A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark

摘要

著录项

相似文献

相关主题

期刊订阅