首页> 外文学位 >A systematic approach to feature selection for encrypted network traffic classification.
【24h】

A systematic approach to feature selection for encrypted network traffic classification.

机译:一种用于加密网络流量分类的特征选择的系统方法。

获取原文
获取原文并翻译 | 示例

摘要

Most organizations, including the Canadian Department of National Defence, allow encrypted traffic on their networks so employees can perform transactions such as personal banking. By allowing legitimate encrypted traffic, it has been shown that non-authorized or malicious traffic in disguise may also bypass security perimeters. Recent research has focused on developing faster and more accurate methods of detecting nonauthorised use by classifying this encrypted traffic and many successes have been demonstrated. Feature-based statistical classification has produced positive results when applied to encrypted traffic and various methods have been used to select the feature sets. However, a literature survey did not find evidence of a systematic approach to select and assess the predictive value of feature sets for use in encrypted traffic classification.;The objective of this research was to develop a general-purpose method of selecting feature subsets that result in high prediction accuracy when used for encrypted traffic classification. The methodology developed uses the fast orthogonal search (FOS) algorithm to select feature subsets with discriminative power. Success was defined in terms of the prediction accuracy of the subset of features selected by the FOS algorithm, as compared to subjectively selected features and features selected by the Best First algorithm. In all experiments the FOS algorithm achieved comparable or better classification results with substantially reduced feature subsets. In the final experiment the FOS algorithm selected a 12-feature subset from a set of 2,839 features. This subset achieved a receiver operating characteristic (ROC) area under the curve (AUC) of 0.9898 compared to a benchmark AUC of 0.9893 achieved using a 44-feature primary set. This translates to 106 fewer errors using a subset of 32 fewer features, and an 81% reduction in computation time for classification.
机译:包括加拿大国防部在内的大多数组织都允许其网络上的加密流量,以便员工可以执行个人银行业务之类的交易。通过允许合法的加密流量,已表明变相的未经授权或恶意流量也可能绕过安全范围。最近的研究集中在通过对这种加密的流量进行分类来开发出更快,更准确的方法来检测未经授权的使用,并且已经证明了许多成功。基于特征的统计分类在应用于加密流量时产生了积极的结果,并且已使用各种方法来选择特征集。但是,文献调查并未找到证据来证明有系统的方法来选择和评估用于加密流量分类的特征集的预测价值。;本研究的目的是开发一种通用的方法来选择产生结果的特征子集用于加密流量分类时,具有较高的预测精度。开发的方法使用快速正交搜索(FOS)算法来选择具有判别能力的特征子集。与主观选择的特征和Best First算法选择的特征相比,FOS算法选择的特征子集的预测准确性定义了成功。在所有实验中,FOS算法在特征子集大大减少的情况下均达到了可比或更好的分类结果。在最终实验中,FOS算法从2839个特征中选择了12个特征子集。与使用44个特征的主要集合实现的基准AUC为0.9893相比,此子集在曲线(AUC)为0.9898时获得了接收器工作特征(ROC)面积。使用32个较少特征的子集,可以减少106个错误,并且分类的计算时间减少81%。

著录项

  • 作者

    Semeniuk, Trevor John.;

  • 作者单位

    Royal Military College of Canada (Canada).;

  • 授予单位 Royal Military College of Canada (Canada).;
  • 学科 Electrical engineering.;Computer science.
  • 学位 M.A.Sc.
  • 年度 2013
  • 页码 99 p.
  • 总页数 99
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号