...
首页> 外文期刊>Science of Computer Programming >Viewing functions as token sequences to highlight similarities in source code
【24h】

Viewing functions as token sequences to highlight similarities in source code

机译:查看功能作为令牌序列以突出显示源代码中的相似之处

获取原文
获取原文并翻译 | 示例
           

摘要

The detection of similarities in source code has applications not only in software re-engineering (to eliminate redundancies) but also in software plagiarism detection. This later can be a challenging problem since more or less extensive edits may have been performed on the original copy: insertion or removal of useless chunks of code, rewriting of expressions, transposition of code, inlining and outlining of functions, etc. In this paper, we propose a new similarity detection technique not only based on token sequence matching but also on the factorization of the function call graphs. The factorization process merges shared chunks (factors) of codes to cope, in particular, with inlining and outlining. The resulting call graph offers a view of the similarities with their nesting relations. It is useful to infer metrics quantifying similarity at a function level.
机译:源代码中相似性的检测不仅在软件重新设计(以消除冗余)方面具有应用,而且在软件抄袭检测中也具有应用。由于对原始副本或多或少地进行了广泛的编辑,因此以后可能会成为一个挑战性的问题:插入或删除无用的代码块,表达式的重写,代码的换位,函数的内联和外联等。 ,我们提出了一种新的相似性检测技术,它不仅基于令牌序列匹配,而且还基于函数调用图的因式分解。分解过程合并了共享的代码块(因子)以应对,尤其是内联和概述。生成的调用图提供了相似性及其嵌套关系的视图。推断在功能级别量化相似性的度量很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号