首页> 外文会议>IEEE International Conference on Progress in Informatics and Computing >Automatic protocol feature word construction based on machine learning
【24h】

Automatic protocol feature word construction based on machine learning

机译:自动协议特征基于机器学习的词结构

获取原文

摘要

Automatic protocol reverse engineering for application protocol is becoming more and more important for many applications such as application protocol analyzer, penetration testing, intrusion prevention and detection. Unfortunately, many techniques for extracting the protocol message format specifications of unknown applications often have some limitations for few priori information or the time-consuming problem. Protocol feature words are byte subsequences within traffic payload that could help distinguish application protocols. In this paper, a new approach is proposed for extracting the protocol message format specifications of unknown applications which is based on the Latent Dirichlet Allocation (LDA) model and Huffman Tree Support Vector Machine (HT-SVM). Firstly, the key words are extracted by utilizing the LDA model, which is a kind of machine learning in document library to extract the theme structure named topic. Secondly, the HT-SVM method is applied to constructing the feature words based on the above process. The proposed approach is implemented and evaluated to infer message format specifications of SMTP binary protocol. Experimental results show that the approach accurately parses and infers SMTP protocol with highly recall rate.
机译:应用协议的自动协议逆向工程对于许多应用程序,例如应用协议分析仪,穿透测试,入侵防御和检测等许多应用越来越重要。遗憾的是,用于提取不知名应用程序的协议消息格式规范的许多技术通常具有一些先验信息或耗时问题的一些限制。协议功能单词是流量有效载荷中的字节子篇文章,可以帮助区分应用程序协议。在本文中,提出了一种新方法来提取基于潜在Dirichlet分配(LDA)模型和霍夫曼树支持向量机(HT-SVM)的未知应用程序的协议消息格式规范。首先,通过利用LDA模型来提取关键词,该模型是文档库中的一种机器学习,以提取名为主题的主题结构。其次,将HT-SVM方法应用于基于上述过程构建特征词。实现并评估所提出的方法,以推断SMTP二进制协议的消息格式规范。实验结果表明,该方法具有高度召回速率的准确解析和Infers SMTP协议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号