【24h】

A Fragment Classification Method Depending on Data Type

机译:取决于数据类型的片段分类方法

获取原文
获取原文并翻译 | 示例

摘要

Data fragment classification is an important problem in many fields, such as intrusion detection, reverse engineering, data recovery, digital forensics and so on. Most of the existing methods try to classify the fragment depending on file type. But the results are poor, because compound file types can contain many other file types, and some file types use the similar data encoding scheme. In this paper, a classification method depending on data type is promoted. In the method the fragment needed to be classified is given a data type instead of file type. First a fragment set including many common data types is created, then the byte frequency distribution and entropy are extracted as features, after that a classifier is built by using those features in training set and SVM algorithm, last the classifier is used to classify the data fragments. The experiment results show that the accuracy of the proposed method is 88.58%, which achieved a 21.2% growth compared with the traditional way.
机译:数据片段分类是许多领域的重要问题,例如入侵检测,逆向工程,数据恢复,数字取证等。现有的大多数方法都尝试根据文件类型对片段进行分类。但是结果很差,因为复合文件类型可以包含许多其他文件类型,并且某些文件类型使用类似的数据编码方案。本文提出了一种基于数据类型的分类方法。在该方法中,需要分类的片段被赋予了数据类型而不是文件类型。首先创建包含许多常见数据类型的片段集,然后提取字节频率分布和熵作为特征,然后使用训练集中的那些特征和SVM算法构建分类器,最后使用分类器对数据进行分类碎片。实验结果表明,该方法的准确度为88.58%,与传统方法相比增长了21.2%。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号