首页> 外文会议>IEEE International Conference on Software Maintenance and Evolution >Revisiting Test Smells in Automatically Generated Tests: Limitations, Pitfalls, and Opportunities
【24h】

Revisiting Test Smells in Automatically Generated Tests: Limitations, Pitfalls, and Opportunities

机译:在自动生成的测试中重温测试的气味:限制,陷阱和机会

获取原文
获取外文期刊封面目录资料

摘要

Test smells attempt to capture design issues in test code that reduce their maintainability. Previous work found such smells to be highly common in automatically generated test-cases, but based this result on specific static detection rules; although these are based on the original definition of "test smells", a recent empirical study showed that developers perceive these as overly strict and non-representative of the maintainability and quality of test suites. This leads us to investigate how effective such test smell detection tools are on automatically generated test suites. In this paper, we build a dataset of 2,340 test cases automatically generated by EVOSUITE for 100 Java classes. We performed a multi-stage, cross-validated manual analysis to identify six types of test smells and label their instances. We benchmark the performance of two test smell detection tools: one widely used in prior work, and one recently introduced with the express goal to match developer perceptions of test smells. Our results show that these test smell detection strategies poorly characterized the issues in automatically generated test suites; the older tool’s detection strategies, especially, misclassified over 70% of test smells, both missing real instances (false negatives) and marking many smell-free tests as smelly (false positives). We identify common patterns in these tests that can be used to improve the tools, refine and update the definition of certain test smells, and highlight as of yet uncharacterized issues. Our findings suggest the need for (i) more appropriate metrics to match development practice; and (ii) more accurate detection strategies, to be evaluated primarily in industrial contexts.
机译:测试气味试图捕获测试代码中的设计问题,从而降低其可维护性。先前的工作发现这种气味在自动生成的测试用例中非常普遍,但是基于特定的静态检测规则得出了这种结果。尽管这些是基于“测试气味”的原始定义,但是最近的一项实证研究表明,开发人员认为这些是对测试套件的可维护性和质量的过于严格且不具有代表性的代表。这使我们研究了这种测试气味检测工具在自动生成的测试套件上的有效性。在本文中,我们建立了由EVOSUITE自动为100个Java类生成的2,340个测试用例的数据集。我们执行了多阶段,经过交叉验证的手动分析,以识别六种类型的测试气味并标记其实例。我们以两种测试气味检测工具的性能为基准:一种在以前的工作中广泛使用,另一种是最近推出的,其明确的目标是使开发人员对测试气味的感知与之相匹配。我们的结果表明,这些测试气味检测策略不能很好地描述自动生成的测试套件中的问题。尤其是旧工具的检测策略,将70%以上的测试气味错误分类,既缺少真实实例(假阴性),又将许多无气味的测试标记为有臭味(假阳性)。我们在这些测试中确定了可用于改进工具,改进和更新某些测试气味的定义以及突出尚未发现的问题的常见模式。我们的发现表明,有必要(i)制定更合适的指标以匹配开发实践; (ii)更准确的检测策略,主要在工业环境中进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号