【24h】

On Precision of Code Clone Detection Tools

机译:关于代码克隆检测工具的精度

获取原文

摘要

Precision and recall are the main metrics used to measure the correctness of clone detectors. These metrics require the existence of labeled datasets containing the ground truth – samples of clone and non-clone pairs. For source code clone detectors, in particular, there are some techniques, as well as a concrete framework, for automatically evaluating recall, down to different types of clones. However, evaluating precision is still challenging, because of the intensive and specialized manual effort required to accomplish the task. Moreover, when precision is reported, it is typically done over all types of clones, making it hard to assess the strengths and weaknesses of the corresponding clone detectors.This paper presents systematic experiments to evaluate precision of eight code clone detection tools. Three judges independently reviewed 12,800 clone pairs to compute the undifferentiated and type-based precision of these tools. Besides providing a useful baseline for future research in code clone detection, another contribution of our work is to unveil important considerations to take into account when doing precision measurements and reporting the results. Specifically, our work shows that the reported precision of these tools leads to significantly different conclusions and insights about the tools when different types of clones are taken into account. It also stresses, once again, the importance of reporting inter-rater agreement.
机译:准确性和召回率是用来衡量克隆检测器正确性的主要指标。这些指标要求存在包含基本事实的标记数据集-克隆对和非克隆对的样本。特别是对于源代码克隆检测器,有一些技术以及具体的框架可用于自动评估召回,直至不同类型的克隆。但是,由于完成任务需要大量且专门的人工工作,因此评估精度仍然具有挑战性。此外,当报告精确度时,通常会在所有类型的克隆上完成,因此很难评估相应克隆检测器的优缺点。本文提出了系统的实验,以评估八种代码克隆检测工具的准确性。三名法官独立审查了12,800个克隆对,以计算这些工具的未区分和基于类型的精度。除了为将来的代码克隆检测研究提供有用的基准外,我们的工作的另一贡献是揭示了进行精确测量和报告结果时要考虑的重要考虑因素。具体来说,我们的工作表明,当考虑到不同类型的克隆时,所报道的这些工具的精确度会导致截然不同的结论和见解。它还再次强调报告评估者间协议的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号