首页> 外国专利> SOURCE CODE SIMILARITY DETECTION USING DIGITAL FINGERPRINTS

SOURCE CODE SIMILARITY DETECTION USING DIGITAL FINGERPRINTS

机译:使用数字指纹进行源代码相似性检测

摘要

Similarity in source code is identified by searching digital fingerprints representing at least control flow of blocks of programming statements. At least some of the source code is converted into a plurality of respective tokens. Each of the tokens is associated with a plurality of blocks. Tokens are modified by normalizing at least one value in at least one of the blocks and/or by defining at least one abstraction. Thereafter, a representation of control flow is created, and a digital fingerprint representing at control flow of a token is generated. Thereafter, source code within at least one block of a given token is determined and identifiable as being a duplicate of source code stored in a repository by comparing at least one of the generated digital fingerprints and at least one previously generated digital fingerprint.
机译:源代码中的相似性通过搜索至少代表编程语句块控制流的数字指纹来识别。至少一些源代码被转换成多个相应的令牌。每个令牌与多个块相关联。通过规范至少一个块中的至少一个值和/或通过定义至少一种抽象来修改令牌。此后,创建控制流的表示,并生成表示令牌的控制流的数字指纹。此后,通过比较所生成的数字指纹中的至少一个和至少一个先前生成的数字指纹,将给定令牌的至少一个块内的源代码确定为并且可识别为存储在存储库中的源代码的副本。

著录项

  • 公开/公告号US2020218535A1

    专利类型

  • 公开/公告日2020-07-09

    原文格式PDF

  • 申请/专利权人 SAUDI ARABIAN OIL COMPANY;

    申请/专利号US201916242414

  • 发明设计人 FIRAS ALOMARI;

    申请日2019-01-08

  • 分类号G06F8/75;G06F8/41;G06F16/901;

  • 国家 US

  • 入库时间 2022-08-21 11:20:12

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号