首页> 外文会议>International Workshop on Software Clones >SemanticCloneBench: A Semantic Code Clone Benchmark using Crowd-Source Knowledge
【24h】

SemanticCloneBench: A Semantic Code Clone Benchmark using Crowd-Source Knowledge

机译:SemanticCloneBench:使用人群源知识的语义代码克隆基准

获取原文

摘要

Not only do newly proposed code clone detection techniques, but existing techniques and tools also need to be evaluated and compared. This evaluation process could be done by assessing the reported clones manually or by using benchmarks. The main limitations of available benchmarks include: they are restricted to one programming language; they have a limited number of clone pairs that are confined within the selected system(s); they require manual validation; they do not support all types of code clones. To overcome these limitations, we proposed a methodology to generate a wide range of semantic clone benchmark(s) for different programming languages with minimal human validation. Our technique is based on the knowledge provided by developers who participate in the crowd-sourced information website, Stack Overflow. We applied automatic filtering, selection and validation to the source code in Stack Overflow answers. Finally, we build a semantic code clone benchmark of 4000 clones pairs for the languages Java, C, C# and Python.
机译:新提出的代码克隆检测技术不仅需要,而且现有技术和工具也需要进行评估和比较。可以通过手动评估报告的克隆或使用基准来完成此评估过程。可用基准的主要限制包括:它们只能使用一种编程语言。它们具有有限数量的克隆对,这些克隆对仅限于所选系统内;他们需要人工验证;它们不支持所有类型的代码克隆。为了克服这些限制,我们提出了一种方法,可以用最少的人工验证为不同的编程语言生成广泛的语义克隆基准。我们的技术基于参与众包信息网站Stack Overflow的开发人员提供的知识。我们对Stack Overflow答案中的源代码应用了自动过滤,选择和验证。最后,我们针对Java,C,C#和Python语言建立了4000个克隆对的语义代码克隆基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号