首页> 外文期刊>IEEE transactions on dependable and secure computing >Benchmarking the Capability of Symbolic Execution Tools with Logic Bombs
【24h】

Benchmarking the Capability of Symbolic Execution Tools with Logic Bombs

机译:用逻辑炸弹基准符号执行工具的能力

获取原文
获取原文并翻译 | 示例

摘要

Symbolic execution has become an indispensable technique for software testing and program analysis. However, since several symbolic execution tools are presently available off-the-shelf, there is a need for a practical benchmarking approach. This paper introduces a fresh approach that can help benchmark symbolic execution tools in a fine-grained and efficient manner. The approach evaluates the performance of such tools against known challenges faced by general symbolic execution techniques, e.g., floating-point numbers and symbolic memories. We first survey related papers and systematize the challenges of symbolic execution. We extract 12 distinct challenges from the literature and categorize them into two categories: symbolic-reasoning challenges and path-explosion challenges. Next, we develop a dataset of logic bombs and a framework for benchmarking symbolic execution tools automatically. For each challenge, our dataset contains several logic bombs, each addressing a specific challenging problem. Triggering one or more logic bombs confirms that the symbolic execution tool in question is able to handle the corresponding problem. Real-world experiments with three popular symbolic execution tools, namely, KLEE, angr, and Triton have shown that our approach can reveal the capabilities and limitations of the tools in handling specific issues accurately and efficiently. The benchmarking process generally takes only a few dozens of minutes to evaluate a tool. We have released our dataset on GitHub as open source, with an aim to better facilitate the community to conduct future work on benchmarking symbolic execution tools.
机译:符号执行已成为软件测试和程序分析的必不可少的技术。但是,由于目前地提供了几种符号执行工具,因此需要实用的基准测试方法。本文介绍了一种新的方法,可以通过细粒度和有效的方式帮助基准符号执行工具。该方法评估这种工具的性能与一般符号执行技术,例如浮点数和符号存储器面临的已知挑战。我们第一次调查相关论文并系统化象征性执行的挑战。我们从文献中提取了12个不同的挑战,并将它们分为两类:象征性的挑战和路径爆炸挑战。接下来,我们开发逻辑BOMB的数据集和自动基准标记符号执行工具的框架。对于每个挑战,我们的数据集包含几个逻辑BOMB,每个逻辑BOMB都解决了特定的具有挑战性问题。触发一个或多个逻辑BOMBS确认有问题的符号执行工具能够处理相应的问题。具有三个流行的符号执行工具,即Klee,Angr和Triton的现实世界实验表明,我们的方法可以准确且有效地揭示处理特定问题的工具的能力和限制。基准过程通常只需要几十分钟的时间来评估工具。我们已将Github上的数据集发布为开源,旨在更好地促进社区对基准测试符号执行工具的未来工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号