首页> 外文会议>IEEE/ACM International Conference on Automated Software Engineering >Revisiting the Relationship Between Fault Detection, Test Adequacy Criteria, and Test Set Size
【24h】

Revisiting the Relationship Between Fault Detection, Test Adequacy Criteria, and Test Set Size

机译:重新审视故障检测,测试充足性标准和测试集大小之间的关系

获取原文

摘要

The research community has long recognized a complex interrelationship between fault detection, test adequacy criteria, and test set size. However, there is substantial confusion about whether and how to experimentally control for test set size when assessing how well an adequacy criterion is correlated with fault detection and when comparing test adequacy criteria. Resolving the confusion, this paper makes the following contributions: (1) A review of contradictory analyses of the relationships between fault detection, test adequacy criteria, and test set size. Specifically, this paper addresses the supposed contradiction of prior work and explains why test set size is neither a confounding variable, as previously suggested, nor an independent variable that should be experimentally manipulated. (2) An explication and discussion of the experimental designs of prior work, together with a discussion of conceptual and statistical problems, as well as specific guidelines for future work. (3) A methodology for comparing test adequacy criteria on an equal basis, which accounts for test set size without directly manipulating it through unrealistic stratification. (4) An empirical evaluation that compares the effectiveness of coverage-based testing, mutation-based testing, and random testing. Additionally, this paper proposes probabilistic coupling, a methodology for assessing the representativeness of a set of test goals for a given fault and for approximating the fault-detection probability of adequate test sets.
机译:研究界长期以来,在故障检测,测试充足率标准和测试集大小之间识别复杂的相互关系。然而,在评估充分性标准与故障检测相关的情况下以及比较测试充足标准时,是否如何在评估充分性标准和比较时进行实际控制测试集尺寸的实验控制。解决混乱,本文提出以下贡献:(1)审查故障检测,测试充足标准和测试集大小之间关系的矛盾分析。具体而言,本文解决了先前工作的假定矛盾,并解释了为什么测试集大小既不是一个混淆变量,如前所述,也不应该是应该通过实验操纵的独立变量。 (2)关于事先工作的实验设计的解释和讨论,以及讨论概念和统计问题,以及未来工作的具体指导。 (3)一种用于比较测试充足标准的方法等于平等的基础,该方法考虑了测试集大小而不通过不切实际的分层直接操纵它。 (4)经验评估,比较了基于覆盖的测试,突变的测试和随机测试的有效性。此外,本文提出了概率耦合,一种用于评估给定故障的一组测试目标的代表性的方法,以及近似于足够测试集的故障检测概率的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号