【24h】

File Block Classification by Support Vector Machine

机译:支持向量机对文件块的分类

获取原文

摘要

Retrieval of files without the support of file system structures is arguably essential for digital forensics. Files are typically stored as sequences of data blocks, which have to be reconstructed in the retrieval process. This is commonly performed, among other approaches, through file carving, in general detecting the original block sequences by means of signatures of known headers and footers of files. Of course, this creates challenges with fragmented files, where blocks belonging to different files may be interleaved. Ways to classify file blocks into file types relying on their content may provide a support to achieve a successful reconstruction. We propose to classify file blocks using Support Vector Machines (SVMs), and we do so by studying in-depth the impact of an appropriate selection of the features used in the classification process. We analyze several potential features and test their performance over a large and representative collection of file blocks and file types. We find out that SVM classifiers can achieve a good accuracy and that a specific type of features (based on byte frequency distribution) performs well across almost all of the examined file types.
机译:对于数字取证来说,在没有文件系统结构支持的情况下检索文件可能是必不可少的。文件通常存储为数据块序列,必须在检索过程中对其进行重建。除其他方法外,这通常是通过文件雕刻来执行的,通常是通过已知文件头和页脚的签名来检测原始块序列。当然,这给碎片文件带来了挑战,在碎片文件中,属于不同文件的块可能会交错插入。根据文件块的内容将文件块分类为文件类型的方法可以为成功重建提供支持。我们建议使用支持向量机(SVM)分类的文件块,并通过我们的深入研究在分类过程中使用的功能,适当的选择的影响,这样做。我们分析了一些潜在功能,并在大量具有代表性的文件块和文件类型集合上测试了它们的性能。我们发现,SVM分类器可以达到较高的准确性,并且特定类型的功能(基于字节频率分布)在几乎所有检查的文件类型上都能表现良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号