【24h】

Optimizing Seed Inputs in Fuzzing with Machine Learning

机译:通过机器学习优化模糊测试中的种子输入

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The success of a fuzzing campaign is heavily de-pending on the quality of seed inputs used for test generation. It is however challenging to compose a corpus of seed inputs that enable high code and behavior coverage of the target program, especially when the target program requires complex input formats such as PDF files. We present a machine learning based framework to improve the quality of seed inputs for fuzzing programs that take PDF files as input. Given an initial set of seed PDF files, our framework utilizes a set of neural networks to 1) discover the correlation between these PDF files and the execution in the target program, and 2) leverage such correlation to generate new seed files that more likely explore new paths in the target program. Our experiments on a set of widely used PDF viewers demonstrate that the improved seed inputs produced by our framework could significantly increase the code coverage of the target program and the likelihood of detecting program crashes.
机译:模糊测试的成功很大程度上取决于用于测试生成的种子输入的质量。但是,组成种子输入的语料库以实现目标程序的高代码和行为覆盖率是非常困难的,特别是当目标程序需要复杂的输入格式(例如PDF文件)时。我们提出了一个基于机器学习的框架,以提高将PDF文件作为输入的模糊程序的种子输入质量。给定初始的种子PDF文件集,我们的框架利用一组神经网络来1)发现这些PDF文件与目标程序中的执行之间的相关性,以及2)利用这种相关性生成更可能探索的新种子文件目标程序中的新路径。我们在一组广泛使用的PDF查看器上进行的实验表明,由我们的框架产生的改进的种子输入可以显着提高目标程序的代码覆盖率和检测程序崩溃的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号