首页> 外文会议>IEEE International Conference on Computer Science and Information Technology >An Approach for Assessing Similarity Metrics Used in-Metric-based CloneDetection Techniques
【24h】

An Approach for Assessing Similarity Metrics Used in-Metric-based CloneDetection Techniques

机译:一种评估基于度量基于公制的Clonedetection技术的相似度量的方法

获取原文

摘要

Similarity is an important concept in information theory. A challenging question is how to measure the amount of shared information between two systems. A large number of metrics are proposed and used to measure similarity between two computer programs or two portions of the same program. In this paper, we present an approach for assessing which metrics are most useful for similarity prediction in the context of clone detection. The presented approach uses clustering to identify clone candidates. In the experiments conducted, we applied sequential clustering using all possible permutations of a subset of the metrics used in metric-based clone detection literature. Precision and recall are calculated in every experiment. Experimental results show that the order of the metrics used affects the results dramatically. This shows that the used metrics are of variable relevance.
机译:相似性是信息理论中的重要概念。一个具有挑战性的问题是如何衡量两个系统之间的共享信息量。提出了大量指标并用于测量两个计算机程序之间的相似性或相同程序的两个部分。在本文中,我们提出了一种评估在克隆检测的背景下对相似性预测最有用的度量最有用的方法。呈现的方法使用聚类来识别克隆候选者。在进行的实验中,我们使用基于度量的克隆检测文献中使用的度量的所有可能的置换来施加顺序聚类。在每个实验中计算精度和召回。实验结果表明,所用度量的顺序显着影响结果。这表明使用的度量是可变相关性的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号