Automatic protocol feature word construction based on machine learning

机译：基于机器学习的协议特征词自动构建

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic protocol reverse engineering for application protocol is becoming more and more important for many applications such as application protocol analyzer, penetration testing, intrusion prevention and detection. Unfortunately, many techniques for extracting the protocol message format specifications of unknown applications often have some limitations for few priori information or the time-consuming problem. Protocol feature words are byte subsequences within traffic payload that could help distinguish application protocols. In this paper, a new approach is proposed for extracting the protocol message format specifications of unknown applications which is based on the Latent Dirichlet Allocation (LDA) model and Huffman Tree Support Vector Machine (HT-SVM). Firstly, the key words are extracted by utilizing the LDA model, which is a kind of machine learning in document library to extract the theme structure named topic. Secondly, the HT-SVM method is applied to constructing the feature words based on the above process. The proposed approach is implemented and evaluated to infer message format specifications of SMTP binary protocol. Experimental results show that the approach accurately parses and infers SMTP protocol with highly recall rate.

机译：对于许多应用程序，例如应用程序协议分析器，渗透测试，入侵防御和检测，用于应用程序协议的自动协议逆向工程变得越来越重要。不幸的是，许多用于提取未知应用程序的协议消息格式规范的技术通常对一些先验信息或耗时的问题有一些限制。协议特征字是流量有效载荷内的字节子序列，可以帮助区分应用程序协议。本文提出了一种基于潜在狄利克雷分配（LDA）模型和霍夫曼树支持向量机（HT-SVM）的未知应用程序提取协议消息格式规范的新方法。首先，利用LDA模型提取关键词，该模型是文档库中的一种机器学习方法，用于提取名为topic的主题结构。其次，基于上述过程，将HT-SVM方法应用于特征词的构造。实施并评估了提出的方法，以推断SMTP二进制协议的消息格式规范。实验结果表明，该方法能够以较高的查全率准确地解析和推断SMTP协议。

著录项

来源
《IEEE International Conference on Progress in Informatics and Computing》|2015年|93-97|共5页
会议地点
作者
Haifeng Li; Bin Zhang; Bo Shuai; Jian Wang; Chaojing Tang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
HT-SVM; LDA; feature word; protocol reverse;

机译：HT-SVM; LDA;特征词;协议反向;

相似文献

外文文献
中文文献
专利

1. A vision-based method for automatic tracking of construction machines at nighttime based on deep learning illumination enhancement [J] . Xiao Bo, Lin Qiang, Chen Yuan Automation in construction . 2021,第Jula期

机译：基于深度学习照明增强的夜间建筑机器自动跟踪的视觉方法
2. Multi-Step Protocol for Automatic Evaluation of Docking Results Based on Machine Learning Methods-A Case Study of Serotonin Receptors 5-HT6 and 5-HT7 [J] . Smusz Sabina, Mordalski Stefan, Witek Jagna, Journal of chemical information and modeling . 2015,第4期

机译：基于机器学习方法的自动评估对接结果的多步骤协议-以5-羟色胺受体5-HT6和5-HT7为例
3. Reinforcement learning with automatic basis construction based on isometric feature mapping [J] . Zhenhua Huang, Xin Xu, Lei Zuo Information Sciences: An International Journal . 2014,第Null期

机译：基于等距特征图的自动基础构建的强化学习
4. Automatic protocol feature word construction based on machine learning [C] . Haifeng Li, Bin Zhang, Bo Shuai, IEEE International Conference on Progress in Informatics and Computing . 2015

机译：自动协议特征基于机器学习的词结构
5. Automatic Liver and Tumor Segmentation from CT Scan Images using Gabor Feature and Machine Learning Algorithms [D] . Shrestha, Ujjwal. 2018

机译：使用Gabor特征和机器学习算法从CT扫描图像自动进行肝脏和肿瘤分割
6. Automatic identification of myopic maculopathy related imaging features in optic disc region via machine learning methods [O] . Yuchen Du, Qiuying Chen, Ying Fan, 2021

机译：通过机器学习方法自动识别光盘区域中的近视静脉病相关显像特征
7. Word based off-line handwritten Arabic classification and recognition. Design of automatic recognition system for large vocabulary offline handwritten Arabic words using machine learning approaches. [O] . AlKhateeb Jawad Hasan Yasin 2010

机译：基于单词的离线手写阿拉伯语分类和识别。利用机器学习方法设计大词汇量离线阿拉伯语手写单词自动识别系统。

Automatic protocol feature word construction based on machine learning

摘要

著录项

相似文献

相关主题

期刊订阅