首页> 外文会议>IEEE International Conference on Parallel and Distributed Systems >Leveraging Inner-Connection of Message Sequence for Traffic Classification: A Deep Learning Approach
【24h】

Leveraging Inner-Connection of Message Sequence for Traffic Classification: A Deep Learning Approach

机译:利用消息序列的内部连接进行流量分类:一种深度学习方法

获取原文

摘要

Classifying traffic flows into source applications is of great value for intelligent network management, which can help to detect malicious attacks, monitor the network, optimize network behaviors and then improve user experience, etc. However, to achieve high-accuracy traffic classification, especially in real time, is very challenging due to very complicated behaviors of traffic flows where network applications could often transmit traffics with encryption at randomized port numbers under highly dynamic network conditions. In this paper, by collecting extensive application traffic flows at the exit router of Shanghai Maritime University (the traffic rate can reach up to 7 GB/s at peak time), we identify that there is a very distinct characteristic in inner-connection of message (grouped by single or multiple consecutive TCP packets) sequence for different application flows. We then propose our traffic classification algorithm, which essentially adopts a Long Short-Term Memory (LSTM) neural network to output a classifier with message sequence vector (not necessarily covering all messages) of a traffic flow as the training input, to conduct online traffic flow classification. Extensive simulations are conduced considering varied training data size and diverse source applications, and an average about 97 % accuracy on per-flow classification can be achieved.
机译:将流量分类为源应用程序对于智能网络管理具有重要价值,它可以帮助检测恶意攻击,监视网络,优化网络行为,然后改善用户体验等。但是,要实现高精度的流量分类,尤其是在由于业务流的行为非常复杂,因此实时性非常具有挑战性,在这种情况下,网络应用程序经常可以在高度动态的网络条件下以随机端口号加密传输业务。在本文中,通过收集上海海事大学出口路由器的大量应用流量(在高峰时间流量可以达到7 GB / s),我们发现消息的内部连接有一个非常明显的特征(按单个或多个连续的TCP数据包分组)不同应用程序流的顺序。然后,我们提出我们的流量分类算法,该算法本质上采用长短期记忆(LSTM)神经网络来输出带有流量流的消息序列向量(不一定覆盖所有消息)的分类器作为训练输入,以进行在线流量流分类。考虑到不同的训练数据大小和不同的源应用程序,可以进行广泛的模拟,并且每个流分类的平均精度约为97%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号