Toward an efficient and scalable feature selection approach for internet traffic classification

Adil Fahad; Zahir Tari; Ibrahim Khalil; Ibrahim Habib; Hussein AInuweiri

首页> 外文期刊>Computer networks >Toward an efficient and scalable feature selection approach for internet traffic classification

【24h】

Toward an efficient and scalable feature selection approach for internet traffic classification

机译：寻求一种有效且可扩展的特征选择方法来进行互联网流量分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

There is significant interest in the network management and industrial security community about the need to identify the "best" and most relevant features for network traffic in order to properly characterize user behaviour and predict future traffic. The ability to eliminate redundant features is an important Machine Learning (ML) task because it helps to identify the best features in order to improve the classification accuracy as well as to reduce the computational complexity related to the construction of the classifier. In practice, feature selection (FS) techniques can be used as a preprocessing step to eliminate irrelevant features and as a knowledge discovery tool to reveal the "best" features in many soft computing applications. In this paper, we investigate the advantages and disadvantages of such FS techniques with new proposed metrics (namely goodness, stability and similarity). We continue our efforts toward developing an integrated FS technique that is built on the key strengths of existing FS techniques. A novel way is proposed to identify efficiently and accurately the "best" features by first combining the results of some well-known FS techniques to find consistent features, and then use the proposed concept of support to select a smallest set of features and cover data optimality. The empirical study over ten high-dimensional network traffic data sets demonstrates significant gain in accuracy and improved run-time performance of a classifier compared to individual results produced by some well-known FS techniques.

机译：对于网络管理和工业安全界，人们非常需要确定“最佳”和最相关的网络流量功能，以便正确表征用户行为并预测未来流量。消除冗余特征的能力是一项重要的机器学习（ML）任务，因为它有助于识别最佳特征，从而提高分类精度并降低与分类器构造相关的计算复杂性。在实践中，功能选择（FS）技术可以用作消除不相关功能的预处理步骤，并可以作为知识发现工具来揭示许多软计算应用程序中的“最佳”功能。在本文中，我们使用新提出的度量标准（即良好性，稳定性和相似性）来研究此类FS技术的优缺点。我们将继续努力开发基于现有FS技术的关键优势的集成FS技术。通过首先结合一些著名的FS技术的结果以找到一致的特征，然后使用提议的支持概念来选择最小的特征集和覆盖数据，提出了一种新颖的方法来有效，准确地识别“最佳”特征。最优性。对十个高维网络流量数据集的经验研究表明，与某些知名的FS技术产生的单个结果相比，分类器的准确性显着提高，并且运行时性能得到了改善。

著录项

来源
《Computer networks》 |2013年第9期|2040-2057|共18页
作者
Adil Fahad; Zahir Tari; Ibrahim Khalil; Ibrahim Habib; Hussein AInuweiri;
展开▼
作者单位

School of Computer Science and Information Technology, RMIT University, Melbourne, Australia;

School of Computer Science and Information Technology, RMIT University, Melbourne, Australia;

School of Computer Science and Information Technology, RMIT University, Melbourne, Australia;

Department of Electrical Engineering, City University of New York, United States;

Electrical & Computer Engineering Program, Texas A& M University at Qatar, Doha, Qatar;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Feature selection; Metrics; Traffic classification;

机译：特征选择度量流量分类;
入库时间 2022-08-18 02:12:44

相似文献

外文文献
中文文献
专利

1. An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification [J] . Hongtao Shi, Hongping Li, Dan Zhang, Computer networks . 2018,第FEBa26期

机译：基于深度学习和特征选择技术的高效交通特征生成方法
2. Optimizing Feature Selection for Efficient Encrypted Traffic Classification: A Systematic Approach [J] . Meng Shen, Yiting Liu, Liehuang Zhu, IEEE Network . 2020,第4期

机译：优化有效加密流量分类的功能选择：系统方法
3. Ensemble Feature Selection and Classification of Internet Traffic using XGBoost Classifier [J] . N Manju, B S Harish, V Prajwal International Journal of Computer Network and Information Security . 2019,第7期

机译：使用XGBoost分类器对互联网流量进行特征选择和分类
4. A novel feature selection technique based on Roach Infestation Optimization for Internet Traffic Classification [C] . Dalila Boughaci, Fatma Belaidi, Imene Kerkouche International Conference on Computer and Information Sciences . 2020

机译：基于蟑螂侵扰优化的网络流量分类新特征选择技术
5. A systematic approach to feature selection for encrypted network traffic classification. [D] . Semeniuk, Trevor John. 2013

机译：一种用于加密网络流量分类的特征选择的系统方法。
6. A scale space approach for unsupervised feature selection in mass spectra classification for ovarian cancer detection [O] . Michele Ceccarelli, Antonio dAcierno, Angelo Facchiano 2009

机译：质谱分类中用于卵巢癌检测的无监督特征选择的尺度空间方法
7. A New Feature Selection Method for Internet Traffic Classification Using ML [O] . Zhen Liu, Qiong Liu 2012

机译：一种基于ML的互联网流量分类的特征选择新方法

Toward an efficient and scalable feature selection approach for internet traffic classification

摘要

著录项

相似文献

相关主题

期刊订阅