首页> 外文会议>2018 IEEE/ACM 40th International Conference on Software Engineering >Journal First Overfitting in Semantics-Based Automated Program Repair
【24h】

Journal First Overfitting in Semantics-Based Automated Program Repair

机译:基于杂志的第一篇基于语义的自动程序修复中的过拟合

获取原文
获取原文并翻译 | 示例

摘要

The primary goal of Automated Program Repair (APR) is to automatically fix buggy software, to reduce the manual bug-fix burden that presently rests on human developers. Existing APR techniques can be generally divided into two families: semantics-vs. heuristics-based. Semantics-based APR uses symbolic execution and test suites to extract semantic constraints, and uses program synthesis to synthesize repairs that satisfy the extracted constraints. Heuristic-based APR generates large populations of repair candidates via source manipulation, and searches for the best among them. Both families largely rely on a primary assumption that a program is correctly patched if the generated patch leads the program to pass all provided test cases. Patch correctness is thus an especially pressing concern. A repair technique may generate overfitting patches, which lead a program to pass all existing test cases, but fails to generalize beyond them. In this work, we revisit the overfitting problem with a focus on semantics-based APR techniques, complementing previous studies of the overfitting problem in heuristics-based APR. We perform our study using IntroClass and Codeflaws benchmarks, two datasets well-suited for assessing repair quality, to systematically characterize and understand the nature of overfitting in semantics-based APR. We find that similar to heuristics-based APR, overfitting also occurs in semantics-based APR in various different ways.
机译:程序自动修复(APR)的主要目标是自动修复有问题的软件,以减少目前应由开发人员承担的手动错误修复负担。现有的APR技术通常可以分为两个家族:语义vs。基于启发式的。基于语义的APR使用符号执行和测试套件来提取语义约束,并使用程序合成来合成满足所提取约束的修复。基于启发式的APR可通过源操纵来生成大量的维修候选者,并在其中寻找最佳的候选者。这两个家族都主要基于一个基本假设,即如果生成的补丁导致程序通过了所有提供的测试用例,则该程序将被正确修补。因此,补丁的正确性是一个特别紧迫的问题。修复技术可能会生成过度拟合的补丁,从而导致程序通过所有现有的测试用例,但无法将其推广。在这项工作中,我们将重点放在基于语义的APR技术上来重新研究过度拟合问题,以补充先前对基于启发式APR中的过度拟合问题的研究。我们使用IntroClass和Codeflaws基准(这两个非常适合评估修复质量的数据集)进行研究,以系统地表征和了解基于语义的APR中过拟合的性质。我们发现,类似于基于启发式的APR,过拟合也以各种不同的方式出现在基于语义的APR中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号