首页> 外文会议>IEEE International Conference on Progress in Informatics and Computing >Automatic protocol feature word construction based on machine learning
【24h】

Automatic protocol feature word construction based on machine learning

机译:基于机器学习的协议特征词自动构建

获取原文

摘要

Automatic protocol reverse engineering for application protocol is becoming more and more important for many applications such as application protocol analyzer, penetration testing, intrusion prevention and detection. Unfortunately, many techniques for extracting the protocol message format specifications of unknown applications often have some limitations for few priori information or the time-consuming problem. Protocol feature words are byte subsequences within traffic payload that could help distinguish application protocols. In this paper, a new approach is proposed for extracting the protocol message format specifications of unknown applications which is based on the Latent Dirichlet Allocation (LDA) model and Huffman Tree Support Vector Machine (HT-SVM). Firstly, the key words are extracted by utilizing the LDA model, which is a kind of machine learning in document library to extract the theme structure named topic. Secondly, the HT-SVM method is applied to constructing the feature words based on the above process. The proposed approach is implemented and evaluated to infer message format specifications of SMTP binary protocol. Experimental results show that the approach accurately parses and infers SMTP protocol with highly recall rate.
机译:对于许多应用程序,例如应用程序协议分析器,渗透测试,入侵防御和检测,用于应用程序协议的自动协议逆向工程变得越来越重要。不幸的是,许多用于提取未知应用程序的协议消息格式规范的技术通常对一些先验信息或耗时的问题有一些限制。协议特征字是流量有效载荷内的字节子序列,可以帮助区分应用程序协议。本文提出了一种基于潜在狄利克雷分配(LDA)模型和霍夫曼树支持向量机(HT-SVM)的未知应用程序提取协议消息格式规范的新方法。首先,利用LDA模型提取关键词,该模型是文档库中的一种机器学习方法,用于提取名为topic的主题结构。其次,基于上述过程,将HT-SVM方法应用于特征词的构造。实施并评估了提出的方法,以推断SMTP二进制协议的消息格式规范。实验结果表明,该方法能够以较高的查全率准确地解析和推断SMTP协议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号