首页> 外文会议>Systems of Signal Synchronization, Generating and Processing in Telecommunications >Lightweight Machine Learning Classifiers of IoT Traffic Flows
【24h】

Lightweight Machine Learning Classifiers of IoT Traffic Flows

机译:物联网流量的轻量级机器学习分类器

获取原文
获取外文期刊封面目录资料

摘要

IoT traffic flows have different from traditional devices statistics and their classification become an important task because of the exponentially growing number of smart devices. Conventional Deep Packet Inspection systems that rely on inspection of open fields in TLS and DNS packets, and the trend of encrypting the open fields makes machine learning based systems the only viable option for future networks. Moreover, computational complexity of models becomes crucial for large-scale operations. In this work, we investigated whether simple models, such as Logistic Regression, SVM with linear kernel, and a Decision Tree, have suitable for real-world deployments performance of multiclass classification of IoT traces, given thoughtful features engineering. We introduced a new flow feature of categorical type that describes a set of TCP-flag fields within a flow. In addition, removal of correlated features and feature space transformation via PCA method showed their usefulness in terms of prediction complexity reduction. In order to account for online classification mode, we limited the maximal number of packets within a flow to 10. Moreover, to estimate the upper-bound performance with given features, we compared the simple algorithms with Random Forest, Gradient Boosting and a feed-forward neural network. We performed 4-fold cross-validation of models by metrics Accuracy and F1-measure. The test results demonstrated that the introduced feature increases F1-measure for logistic regression from 99.1% in the base case to 99.6%, thus closely approaching more computationally expensive models. Overall, the evaluation results demonstrated feasibility of a lightweight model for IoT flow classification task with the suitable for a practical deployment performance.
机译:物联网流量与传统设备统计数据不同,由于智能设备数量呈指数级增长,因此其分类成为一项重要任务。依靠对TLS和DNS数据包中的开放字段进行检查的传统深度数据包检查系统,以及对开放字段进行加密的趋势,使得基于机器学习的系统成为未来网络的唯一可行选择。此外,模型的计算复杂性对于大规模操作变得至关重要。在这项工作中,我们研究了简单的模型(例如Logistic回归,具有线性内核的SVM和决策树)是否具有适用于IoT跟踪的多类分类的实际部署性能(考虑到周到的功能工程)。我们引入了分类类型的新流功能,该功能描述了流中的一组TCP标志字段。此外,通过PCA方法去除相关特征和特征空间变换显示了其在降低预测复杂度方面的有用性。为了考虑在线分类模式,我们将一个流中的最大数据包数限制为10。此外,为了估算具有给定功能的上限性能,我们将简单算法与随机森林,梯度提升和Feed-前向神经网络。我们通过度量准确性和F1度量对模型进行了4倍交叉验证。测试结果表明,引入的功能将逻辑回归的F1量度从基本情况下的99.1%增加到了99.6%,从而更加接近了计算成本更高的模型。总体而言,评估结果证明了针对物联网流分类任务的轻量级模型的可行性,并适合实际部署性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号