首页> 外文会议>2019 12th IEEE Conference on Software Testing, Validation and Verification >Comparing Mutation Testing at the Levels of Source Code and Compiler Intermediate Representation
【24h】

Comparing Mutation Testing at the Levels of Source Code and Compiler Intermediate Representation

机译:在源代码和编译器中间表示级别比较变异测试

获取原文
获取原文并翻译 | 示例

摘要

Mutation testing is widely used in research for evaluating the effectiveness of test suites. There are multiple mutation tools that perform mutation at different levels, including traditional mutation testing at the level of source code (SRC) and more recent mutation testing at the level of compiler intermediate representation (IR). This paper presents an extensive comparison of mutation testing at the SRC and IR levels, specifically at the C programming language and the LLVM compiler IR levels. We use a mutation testing tool called SRCIROR that implements conceptually the same mutation operators at both levels. We also employ automated techniques to account for equivalent and duplicated mutants, and to determine minimal and surface mutants. We carry out our study on 15 programs from the Coreutils library. Overall, we find mutation testing to be better at the SRC level: the SRC level produces much fewer mutants and is thus less expensive, but the SRC level still generates a similar number of minimal and surface mutants, and the mutation scores at both levels are very closely correlated. We also perform a case study on the Space program to evaluate which level's mutation score correlates better with the actual fault-detection capability of test suites sampled from Space's test pool. We find the mutation score at both levels to not be very correlated with the actual fault-detection capability of test suites.
机译:变异测试在研究中广泛用于评估测试套件的有效性。有多种变异工具可以在不同级别执行变异,包括在源代码(SRC)级别进行传统的变异测试,以及在编译器中间表示(IR)级别进行最新的变异测试。本文对SRC和IR级别的突变测试进行了广泛的比较,特别是在C编程语言和LLVM编译器IR级别。我们使用一种称为SRCIROR的变异测试工具,该工具在概念上在两个级别上都实现了相同的变异运算符。我们还采用自动化技术来解释等效和重复的突变体,并确定最小和表面突变体。我们对Coreutils库中的15个程序进行了研究。总的来说,我们发现突变测试在SRC级别上更好:SRC级别产生的突变体少得多,因此价格便宜,但是SRC级别仍然产生相似数量的最小和表面突变体,并且两个级别的突变得分均为紧密相关。我们还对Space程序进行了案例研究,以评估哪个级别的突变得分与从Space的测试池中采样的测试套件的实际故障检测能力更好地相关。我们发现这两个级别的突变评分与测试套件的实际故障检测能力都没有很大的关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号